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Preface 



These proceedings contain papers presented at the 8th Discrete Geometry for 
Computer Imagery conference, held 17-19, March 1999 at ESIEE, Marne-la- 
Vallee. 

The domains of discrete geometry and computer imagery are closely related. 
Discrete geometry provides both theoretical and algorithmic models for the pro- 
cessing, analysis and synthesis of images; in return computer imagery, in its 
variety of applications, constitutes a remarkable experimentational field and is 
a source of challenging problems. 

The number of returning participants, the arrival each year of contributions 
from new laboratories and new researchers, as well as the quality and originality 
of the results have contributed to the success of the conference and are an in- 
dication of the dynamism of this field. The DGCI has become one of the major 
conferences related to this topic, including participating researchers and labo- 
ratories from all over the world. Of the 41 papers received this year, 24 have 
been selected for presentation and 7 for poster sessions. In addition to these, 
four invited speakers have contributed to the conference. 

The site of Marne-la-Vallee, just 20 min away from Paris, is particularly well- 
suited to hold the conference. Indeed, as a newly built city, it showcases a great 
amount of modern creative architecture, whose pure lines and original shapes 
offer a favorable context for the topic of Geometry. 

We wish to thank ESIEE for providing the facilities. We also wish to thank 
for their financial support: PRC-GDR ISIS and AMI from the French GNRS, 
the city of Marne-la-Vallee (EPAMARNE), the Universite de Marne-la-Vallee, 
the Ministere de la Recherche et de la Technologie, and more generally all the 
people who helped us organize the conference and made it possible. 



ESIEE, March 17, 1999 



Gilles Bertrand, 
Michel Gouprie, 
Laurent Perroton 
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Abstract. This paper introduces a dimension-independent multiresolu- 
tion model of a shape, called the Multi-Complex (MC) , which is based on 
decomposition into cells. An MC describes a shape as an initial cell com- 
plex approximating it, plus a collection of generic modification patterns 
to such complex arranged according to a partial order. The partial order 
is essential to extract variable-resolution shape descriptions in real time. 
We show how existing multiresolution models reduce to special cases of 
MCs characterized by specihc modification patterns. The MC acts as a 
unifying framework that is also useful for comparing and evaluating the 
expressive power of different approaches. 



1 Introduction 

Multiresolution geometric models support representation and processing of spa- 
tial entities at different levels of detail. Such representations have gained recently 
much of attention in the literature because of their potential impact on appli- 
cations, such as terrain modeling in geographic information systems, scientific 
data visualization, virtual reality, etc. The basis for a multiresolution geometric 
model is the decomposition of the shape it describes into simple elements, called 
cells. 

Cell complexes are used as discrete models of a variety of shapes in two, three 
or higher dimensions. For example, two-dimensional complexes made of polyg- 
onal cells are used for describing the boundary of solid objects. In particular, 
two-dimensional simplicial complexes (triangle meshes) are used for representing 
surfaces in computer graphics, and for representing terrains in geographic appli- 
cations. In solid modeling, three-dimensional complexes are used for describing 
the interior of an object as well as its boundary. d-Dimensional simplicial com- 
plexes are used as approximate representations of scalar or vector fields. 

The accuracy of a cell complex in representing a shape depends on the size, 
number, and density of its cells: a parameter that we call the resolution of the 
complex. A high resolution, and thus a high number of cells, is needed to pro- 
duce accurate descriptions. On the other hand, maximum accuracy is not always 
required in each part of a shape, but a sufficiently high accuracy for the specific 
application task can be achieved by locally adapting the resolution of a complex 
in different parts of the shape, thus reducing processing costs and memory space. 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 3—18, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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The process of building an adaptive cell complex from scratch is usually time- 
consuming. Sophisticated techniques are required, for instance, to decide where 
to place the vertices, how to optimize the shape of the cells, how to estimate and 
bound the approximation error. With a multiresolution approach, most of the 
time-consuming operations are performed off-line, in order to build a structure 
which can be queried efficiently on-line, according to the parameters specified 
by the application task. 

Several multiresolution models for cell complexes have been proposed in the 
literature, but the field still lacks a unifying framework since: 



— Most existing models are designed for specific applications, or classes of 
applications (i.e., virtual reality, rendering, terrain modeling) and designed 
for certain operations. 

— Existing models usually rely on a specific construction techniques: such mod- 
els can only obtained from an initial cell complex by applying a specific type 
of transformation operator. 

— Most proposed models are a direct abstraction of the data structure used to 
implement them. 

Here, we formally define a general multiresolution model of a shape based 
on cell complexes, called the Multi- Complex (MC), which extends the Multi- 
Triangulation, proposed in [Pup96,DFPM97]. The MC describes a partially or- 
dered set of modification patterns specifying how an initial coarse cell complex 
can be iteratively refined in order to obtain more and more accurate represen- 
tations. The partial order captures dependency relations between modifications 
that affect the same portion of the complex. This framework permits to select 
subsets of modification patterns which are consistent with the partial order: 
when such modification patterns are applied to the initial cell complex, a variety 
of complexes at different, variable resolutions can be obtained. 

The Multi-Complex is independent of the specific type of modification pat- 
tern. Existing multiresolution models reduce to special cases of MCs where modi- 
fication patterns have certain characteristics. In the MC framework, we deal with 
generic fc-dimensional cell complexes in which decompose a regular shape, 
and we allow more general modifications, including refining boundaries, changing 
the topological type, merging and splitting connected components. 

The remainder of the paper is organized as follows: In Section 2, we introduce 
the concept of modification pattern, an entity which describes a modification of 
a cell complex, and plays a fundamental role in the definition of the Multi- 
Complex. In Section 3, we define the Multi- Complex (MC) as a structure which 
organizes a partially ordered set of modification patterns in the form of a directed 
acyclic graph and we discuss its more relevant properties. In Section 4, we show 
how existing multiresolution models can be seen as special instances of a Multi- 
Complex and we discuss their properties in such a framework. Finally, Section 
5 presents some concluding remarks. 
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2 Preliminaries 

2.1 Background Notions 

A k-dimensional cell in is a subset of homeomorphic to a closed k- 
dimensional ball. The union as pointset of a set of cells F in E'^ is called the 
carrier of F. A cell complex is a finite set of cells F in E"^ of heterogeneous 
dimension such that their interiors are disjoint, their union cover the carrier, 
and the boundary of each cell is made of cells of lower dimensions belonging to 
the complex. A cell 7 in a complex A is a top cell if there is no other cell in F 
bounded by 7. A fc-dimensional cell complex is called regular if all its top cells 
are fc-dimensional cells. 

A special and interesting case is that of simplicial complexes (e.g., triangle 
and tetrahedral meshes): a k-dimensional simplex in E'^ is the locus of the points 
in E“^ that can be expressed as the convex combination of fc + 1 affinely indepen- 
dent points; a k-dimensional simplicial complex is a cell complex where all top 
cells are simplices, and every other cell is generated by an appropriate subset of 
vertices of some top cell. 

Thus, the basic elements of an approach to modeling a geometric shape 
through cell complexes are: 

— a regular cell complex F, where each top cell has an associated approxima- 
tion error, and where, in general, each cell may have application-dependent 
attributes associated with it; 

— possibly, a function /, piecewise defined on the top cells of F. 

For instance, (fc -I- l)-dimensional scalar fields are described by subdividing 
the fc-dimensional domain of the field into a regular fc-dimensional simplicial 
complex, and by associating an approximation of the field to each fc-dimensional 
simplex. 

We are interested in modeling at multiple resolutions the cell complex F 
underlying the model of a spatial entity, independently of the function / defined 
on its top cells and on the attributes of cells. 



2.2 Modifications of a Cell Complex 

A modification is an operation that replaces a cell complex Fold (possibly con- 
tained in a bigger complex F) with another cell complex F^ew The interesting 
case is when Fqm and Fnew are approximations at two different resolutions of 
the same (portion of a) shape. A modification pattern specifies a modification as 
the pair of cell complexes C = {Fold, Fnew) - In what follows, we use the symbols 
77 i(C) and Fl2{C) to denote the first and second element, respectively, of the 
pair of cell complexes in a modification pattern C. 

Applying a modification pattern C = {Fold, Fnew) to a cell complex T consists 
of replacing Fom with Fnew in F. The result of applying C to F, denoted with 
r[C], is the set of cells {F \ Fold) U Fnew- Note that set F[C] may not always be 
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a cell complex. We are interested in the case where r[C] is a cell complex, and 
C does not open or close holes in the boundary of F. This corresponds to the 
notion of validity of a modification pattern for a cell complex: 

Definition 1. Let F be a regular k-dimensional cell complex in and C = 
{Fold, Fnew) be a k-dimensional modification pattern in E^*. C is a valid modifi- 
cation pattern for F if and only if 

^oid C F (as sets of cells); 

2. the cell complex obtained by deleting Fold from F does not intersect F^ew 
outside the common cells of the boundaries of Fold an-d F^ewi 

3. the boundary of Fnew must match with the boundary of the “hole” created in 
F when removing Fold- 



pO 




hole 





Fig. 1. Six modification patterns on a two-dimensional cell complex: Ci and C 2 are 
valid; C 3 is not valid since it violates condition 2; C4 is not valid since it violates 
condition 3. 



Examples of valid and invalid modification patterns are shown in Figure 1. 
It has been proven that the result F[C\ of applying a valid /c-dimensional mod- 
ification pattern (7 to a regular fc-cell complex F is either empty, or is a regular 
fc-dimensional cell complex [Mag98]. 

For the purpose of defining a multiresolution model, we are interested in k- 
dimensional refinement patterns, i.e., modification patterns where Fnew contains 
more fc-cells than Fold, and in minimal modification patterns, i.e., modification 
patterns that cannot be split into two or more valid modifications to be be 
performed in a sequence. Modification patterns that induce changes in the topo- 
logical type of the shape (e.g., increasing or decreasing its genus, merging or 
splitting connected components) will be allowed. 

2.3 Sequences of Modification Patterns 

A sequence of modification patterns is valid for a cell complex Tq if and only if 
every modification pattern Ci in the sequence is valid for the complex obtained 
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as the result of successively applying to Iq all the patterns preceding Q in the 
sequence. 

We say that a modification pattern C directly depends on another modifi- 
cation pattern C (and, thus, C directly blocks C') if and only if the effect of 
applying C' is removing some of the cells introduced by applying C. In a set of 
modification patterns, two modification patterns depend on each other if they 
are in the transitive closure of the relation of direct dependency; in a similar way 
we can define when they block each other. 

In a generic valid sequence a cell 7 may be created and removed several times 
and this creates cycles in the relation of dependency. Thus, we define a modifi- 
cation pattern as non-redundant with respect to a set of modification patterns 
if it does not recreate fc-cells eliminated by some other modification pattern of 
the set. It is easy to verify that, in non-redundant and valid sequences, each 
cell either is in the initial complex, or there is exactly one modification pattern 
specifying its “creation”; moreover, each cell either is in the final complex, or 
there is exactly one modification pattern specifying its “deletion”. 

Even if two modification patterns are independent, they cannnot necessarily 
be applied to the same cell complex without interfering. We say that Ci and Cj 
are not in conflict when all the pairs of complexes UflCi) and IIi(Cj), UflCi) 
and Il2{Cj), Il2{Ci) and IIi{Cj), and Il2{Ci) and Il2{Cj) intersect at most 
in a subset of their boundary cells, which are preserved in the modifications 
represented by Ci and Cj. This means that any of such pairs share the same 
subset of boundary cells, and the union of all their cells forms a complex. A set 
of modification patterns is thus conflict-free if and only if there are no conflicts 
between pairs modification patterns that are independent. 

3 The Multi-Complex and its Properties 

In a Multi-Complex, a partially ordered set of modification patterns of a cell 
complex is encoded in the form of a directed acyclic graph (DAG), where nodes 
represent modification patterns, and arcs represent relations of direct depen- 
dency between them. An MC is defined by abstracting over the relation of direct 
dependency from a valid, non-redundant, and conflict-free sequence of minimal 
refinement patterns. The non-redundancy of the sequence ensures that the direct 
dependency relation is a partial order. Since all nodes describe refinement pat- 
terns, the resolution monotonically increases while traversing the DAG according 
to the direction of its arcs. The validity and absence of conflicts ensure that, for 
every subset of modification patterns “closed” with respect to the dependency 
relation, the result of applying its elements, sorted in any total order consistent 
with the partial one, is a cell complex. 

Definition 2. Let Iq be a k-dimensional cell complex, Gi, 6 * 2 , . . . , C„ be a se- 
quence of k-dimensional modification patterns such that: 

— foreveryl < i < n,Ci is a refinement pattern, itisminimal, and IlflCi) yf 0; 

— Gi, G 2 , ■ . . ,Cn is a valid sequence for Iq; 
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— sequence Cq, Ci, C 2 , . . . ,Cn is non-redundant, where Cg = ( 0 , Fq); 

— set {Cl, . . . , Cn} is conflict-free. 

Then, a Multi-Complex (MC) is a directed graph A4 where: 

— A4 has n nodes, which are in one-to-one correspondence with the modification 
patterns of set [Cq, . . . , Cn}; 

— there is an arc (Ci,Cj) if and only ifCj directly depends on Ci. 

A simplicial Multi-Complex is a special case of MC where each modification 
pattern consists of a pair of simplicial complexes. The MC of Figure 2(a) is a 
simplicial MC. 




Fig. 2. (a) A Multi-Complex: each node is a modification pattern Ci, where UiiCi), 
and Il2{Ci) are the complex on the left side and on the right side, respectively; (b) its 
front cell complex. 



Given a Multi-Complex A4 , and a consistent sequence of its nodes (including 
the root Co) Co, Ci, . . . , C„ , the front cell complex of A4 is 

F{M) = n2{Co)[Ci]---[Cn]. 

The front cell complex is a regular cell complex, and it is uniquely defined because 
it is independent on the specific sequence considered [Mag98]. Figure 2(b) shows 
the front complex associated with the MC of Figure 2(a). 

A sub-MC Ai' of a Multi-Complex Ai identifies a subset of the modification 
patterns of AA which contains the root Co and is closed with respect to the 
dependency relation (i.e., it contains the parents of any of its nodes). Since the 
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MC-nodes represent refinement patterns, iF(Af') is a cell complex at a lower 
resolution than the front cell complex of j\4 (see Figure 3). Different sub-MCs of 
A4 provide cell complexes at different resolutions. Cell complexes at intermediate 
resolutions are front complexes of sub-MCs. 





(b) 

Fig. 3. A sub-MC containing nodes {0,3, 4, 5} (a), and its front cell complex (b). 



In what follows , we will discuss some properties of an MC which are relevant 
to the efficiency of its encoding data structures, and of the query algorithms op- 
erating on it, as discussed in [DFMP98,Mag98]. Such properties are determined 
by the construction algorithm used to produce the initial cell complex and the 
sequence of modification patterns which form the basis of a Multi-Complex. 

We define the height of a Multi-Complex as the maximum number of arcs 
in a path from the root to a leaf; the width of a node C as the number of top 
cells in 772 (C), and the width of a Multi-Complex as the maximum width of its 
nodes; the size of a Multi-Complex as the total number of top cells in it, and 
the size of the front cell complex of an MC as the number of its top cells. 

We are interested in Multi-Complexes which have a height logarithmic in the 
number of its top cells, and a width bounded from above by a small constant b. In 
[DFMP97,Mag98], we have shown that bounded width and logarithmic height 
are important for the efficiency of traversal operations on an MC required in 
query processing, such as a point location or a range query. 
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We say that an MC A4 has a linear growth if and only if there exists a positive 
real number b such that for every sub-MC Ai' of A4, the ratio between the size 
of A4' and the size of the front cell complex of Ai' is less or equal to b. It can 
be easily seen that if an MC has bounded width, then it has a linear growth as 
well. 

If an MC has a linear growth, then the size of any sub-MC is linear in the 
size of its front complex. In particular, the size of the MC itself is linear in 
the size of its front cell complex. This means that the multiresolution structure 
introduces only a linear storage overhead with respect to a simple cell complex 
at the maximum resolution. 

In [Pup96,Mag98], we have shown that a linear growth is fundamental to 
achieve optimal time complexity (i.e., linear in the output size) for algorithms 
which extract a cell complex at a given resolution from an MC: since these 
algorithms perform a DAG traversal to find the appropriate sub-MC, linear 
growth guarantees that the size of the traversed sub-MC is linear in the output 
size. 

4 Existing Multiresolution Models as Multi-Complexes 

In this section, we show how multiresolution models presented in the literature 
can be seen as special cases of Multi-Complexes. Each of these models gives a 
class of MCs characterized by certain properties of the modification patterns, 
and, consequently, by certain properties in terms of width, height and growth. 
Moreover, each existing model provides an implementation of an MC with a 
specific data structure. 

Often some information present in the MC is either lost or incorrectly repre- 
sented in some data structures. Examples of lost information are found in models 
which encode a linear sequence [Hop96,KS97] instead of the DAG; we will see 
examples of incorrectly represented information, for instance, in nested models. 
Such an aspect will be examined for existing models in Sections 4.1, and 4.2. For 
the sake of brevity, we explain the interpretation of just two-dimensional mul- 
tiresolution models as two-dimensional MCs. Higher-dimensional models can be 
interpreted in a similar way. 



4.1 Nested Models 

Nested models are based on a nested subdivision of a square or triangular do- 
main into scaled copies of it and are usually described by a tree, where each 
node corresponds to a complex which covers one of the cells of its parent node. 
Examples of nested models are the quadtree, the quaternary triangulation, the 
restricted quadtree, the hierarchies of right triangles and the hierarchical irreg- 
ular triangulations. Figure 4 (a) shows an example of a hierarchical irregular 
triangulation. 

In a fc-dimensional nested model, a node represents the refinement of a top- 
cell 7 of its parent node into a complex T'. The corresponding modification 
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pattern is C = ({7}, F'), where F is the complex contained in the parent node. 
In general (and in most existing nested models), C is not a valid modification 
pattern for F, since F' may split some of the {k— l)-facets of 7: the intersections 
of the cells oi F \ {7} with those of F' do not coincide with the common cells of 
their boundary complexes, thus violating the second condition in the definition 
of Multi-Complex. Figures 4 (a) and (b) illustrate this problem. A consistent 
cell complex is especially important when interpolating functions are defined 
over the cells, as in the case of scalar and vector fields. 




Fig. 4. (a) The tree representing a nested model, (b) The complex F' refining a top-cell 
7 does not define a valid modification pattern for the complex F containing 7; in fact 
the boundary of F' does not match with that of the adjacent cell to 7 in F. (c) Clusters 
of tree nodes that become MC-nodes when interpreting the nested model as an MC. 



If F' splits a (fc — l)-facet 7' of 7, this means that 7 can be replaced consis- 
tently with F' only if also the cell 71, adjacent to 7 along 7' in F, is replaced 
by a complex F[ splitting facet 7' in the same way. Thus, nodes in the tree 
representing the refinement of 7 and 7' must be clustered to form one node of 
the Multi-Complex. The result of such node clustering is shown in Figure 4 (c). 

A Multi-Complex representing a fc-dimensional nested model is obtained as 
follows: 

— the MC-nodes are obtained from the nodes of the tree by iteratively joining 
pairs of nodes corresponding to complexes in which two adjacent /c-cells 
have been refined by decomposing their common (k — l)-facet; this defines 
a mapping of nodes of the tree to MC-nodes. 

— the MC arcs are obtained from the tree arcs according to the following rule: 
for every arc (iVi,A^2) of the tree, if N\ maps to an MC-node C\ and N2 
maps to an MC-node C2, then arc (C'i,C'2) represents (A^i, A^2) in the MC; 
this defines a mapping of the arcs of the tree to arcs of the MC. 

The Multi-Complex derived from a given nested model shows what consistent 
cell complexes can be obtained from it (i.e., cell complexes that avoid cracks in 
the graph of a scalar or vector field represented by the nested model). Such 
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consistent cell complexes are in one-to-one correspondence with the sub-MCs 
of the corresponding Multi-Complex. The range of (possibly inconsistent) sets 
of cells obtained from a nested model is in one-to-one correspondence with the 
analogous sub-structures of the tree. Since several nodes of the tree are merged 
into one MC-node, it follows that some sub-structures of the tree are not captured 
by the MC: these are exactly those cuts leading to inconsistent representations. 

Since no knowledge about different tree nodes that must be clustered to form 
an MC-node is explicitly encoded in the tree, algorithms for extracting generic 
(i.e., variable resolution) complexes from nested models are more involved. 

Nested models based on the recursive subdivision of a domain according to a 
fixed pattern that splits every {k — l)-facet of the current /c-cell (e.g., quadtrees 
and their extensions [Sam90], quaternary triangulations [GG79]), correspond to 
Multi-Complexes where the DAG reduces to a list: all the nodes in a level of the 
tree must be merged together into one MC-node. Thus, the range of consistent 
complexes that can be obtained from such models reduces to one for every tree 
level. 

Nested models that leave some (fc — l)-facet of a /c-cell 7 unrefined when 
expanding 7 into a complex (e.g., hierarchical triangulations [SP92,DFP95], hi- 
erarchies of right triangles [DWS“*'97,LKR+96]) may have a better expressive 
power. In general, if (fc — l)-cells are split too often, then the nodes of the tree 
tend to be clustered into large MC-nodes, hence exhibiting a low expressive 
power. On the other hand, the persistence of (fc— l)-cells across the levels of the 
tree tends to produce silvery cells, which are not desirable. In two dimensions, 
the best compromise seems to have been achieved with hierarchies of right tri- 
angles, where, at each level, one edge of each triangle is split, and two edges are 
retained to be split at the next level (see Figure 5). 

4.2 Evolutionary Models 

Evolutionary models track the evolution of an initial cell complex /q through a 
sequence of local modifications, which is assumed to be a valid sequence for Fq. 

A Multi-Complex for such models represents the local modifications of the 
sequence in its nodes, and encodes their relations of direct dependency in its 
arcs: there is an arc from a node Ci = {ri,r[) to a node Cj = {rj,Fj) if and 
only if the intersection of T/ and Fj (as sets of cells) contains at least one top- 
cell. The validity and non-redundancy of the sequence ensures that the resulting 
graph is a DAG. 

In most existing models, the local modifications are performed by applying 
a specific operator (see Figure 6): 

- In [Hop96,XESV97,Hop97,MMS97,GTLH98,KCVS98] the refinement pat- 
tern is a vertex split, which expands a vertex p into an edge, two edges 
incident at p into two triangles, and warps the triangles surrounding p ac- 
cordingly. 

- In [DF89,CDFM+94,dBD95,BFM95,KS97,CPS97,DFMP98] the refinement 
pattern consists of inserting a new vertex p, and replacing a subset of trian- 
gles in the neighborhood of p with a set of triangles incident at p. Note that 
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Fig. 5. A hierarchy of right triangles (a), with meshes shown level by level, and the 
clusters of tree nodes corresponding the nodes of the Multi-Complex (b). 




VERTEX INSERTION VERTEX' SPLIT 



Fig. 6. Vertex insertion and vertex split. The shaded triangles are those involved in 
the modification pattern. 



a vertex split where the edge created by splitting p has p itself as one of its 
endpoints can be regarded as a special case of vertex insertion. 

In all models, the local modifications guarantee the non-redundancy and the 
minimality of the modifications. 

Evolutionary models proposed in the literature represent the local modifica- 
tions and/or the cells involved within them by using different structures, from 
sequential lists to DAGs. In the following we illustrate the relationship between 
those structures and the corresponding Multi-Complex. 



Sequential Models Sequential models [KS97,Hop96] keep the local modifica- 
tions in a totally ordered list, which is consistent with the partial order repre- 
sented in the Multi-Complex. Usually, such total order is either the same, or the 
reverse of the order in which the local modifications have been performed during 
the construction of the model. Dependencies between local modifications are not 
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(b) (a) 



Fig. 7. (a) A sequence of edge collapses, to be read reversed as a sequence of vertex 
splits. Shaded triangles correspond to the two meshes forming a modification pattern in 
the MC. Triangles marked with a circle are those considered in [Hop97] for determining 
the vertex dependencies, (b) The corresponding binary forest of vertices. 



explicitly stored, and thus they must be reconstructed on-line when extracting 
a complex at variable resolution by performing considerable backtricking. 

In terms of the corresponding MC, the models in [KS97] and [Hop96] do not 
guarantee bounded width, logarithmic height or linear growth. 

Hierarchies of Vertices Hierarchies of vertices identify a local modification 
(namely, a vertex split) with the vertex v being split. Thus, dependencies between 
modification patterns become dependencies between vertices. Most models (e.g., 
[Hop97,XESV97]) distinguish between two cases of dependency: 

1. if a vertex v is an endpoint of the edge created by splitting w, then v depends 
on w; 

2. V also depends on (some of) the vertices that are incident at v when v is 
split. 

Dependencies of type 1 are stored as a binary tree of vertices (every vertex split 
creates two new vertices, which become its children); dependencies of type 2 are 
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Fig. 8. (a) The Multi-Complex corresponding to the sequence of edge splits in Figure 
7; the arcs emanating from the root are shown dashed since the root does not represent 
a vertex splits, (b) The DAG representing vertex dependencies according to [XESV97]; 
(c) The DAG representing vertex dependencies according to [Hop97]; (d) The DAG 
representing vertex dependencies according to [MMS97]; redundant arcs are dashed, 
missing arcs are dotted and marked with ‘=’. 




Fig. 9. Edge collapse and its reverse vertex split. According to [XESV97], vertices vi 
and V 2 depend on the vertices of all triangles incident at v (i.e., wi, W 2 ,Pi,P 2 ,P 3 ,Pi,Pb)\ 
according to [Hop97], «i and V 2 depend on the vertices of the shaded triangles (i.e., 
wi,W2,pi,P3,P4,P5)’, according to [MMS97], vi and V 2 depend just on wi and W 2 - 



stored separately. The various authors define dependencies of type 2 in different 
ways. 

Xia, El-Sana, and Varshney [XESV97] consider a vertex v as depending on 
all the vertices that are adjacent to v when v is split. With respect to the Multi- 
Complex, the resulting DAG contains redundant arcs, which correspond to paths 
in the MC (see Figure 8 (b)). 

Hoppe [Hop97] considers a vertex v as dependent only of the vertices of the 
four facets incident at the two edges that are expanded into triangles when v is 
split (see Figure 9). However, this does not capture all the existing dependencies, 
and there are still redundant arcs (see Figure 8 (c)). 

Progressive TINs [MMS97] consider a vertex p as dependent of its parent in 
the binary tree, and of the other endpoints of the two edges that are expanded 
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into triangles when splitting/). Again, some dependencies are missing (see Figure 
8 (d)). 

The model by Luebke and Erikson [LE97] is based on modifications that are 
“generalized” vertex splits: a vertex may be split into two vertices not necessarily 
connected by a segment (such an operation is the inverse of not only edge collapse 
but also vertex-pair collapse, when the two collapsed vertices are not adjacent 
in the mesh) . This model keeps just the binary tree of vertices, and ignores all 
other existing dependencies. 

The problem of redundant arcs is intrinsic to the binary tree of vertices: a 
vertex split may depend only indirectly on its parent, since other modifications 
may have altered the configuration of triangles between the parent and the child. 
For instance, the binary tree shown in Figure 7 contains arc (6, 1), which is not 
in the corresponding MC shown in Figure 8. 

Extra arcs increase the space requirements of data structures but do not 
compromise the integrity of the model. On the contrary, missing arcs are a 
problem because DAG traversal algorithms working on such models may extract 
inconsistent cell complexes. 

A cell complex extracted from a model affected by missing dependency links 
[Hop97,LE97,MMS97] may contain intersecting triangles, and triangles which 
were do not belong to any modification pattern in the model. The problem of 
“extra” triangles is relevant in applications where it is necessary to know the 
approximation error of any triangle with respect to the original shape. 

Gueziec et at. [GTLH98] store vertex dependencies explicitly by means of a 
DAG, but they consider dependencies as in [XESV97], thus their DAG contains 
some redundant arcs. 

The models of [Hop97,GTLH98,MMS97] do not guarantee bounded width, 
logarithmic height or linear growth. The Multi-Gomplex corresponding to the 
model of [XESV97] has linear growth, and logarithmic height, but it does not 
necessarily have bounded width. 



5 Concluding Remarks 

In this paper, we have defined the Multi-Gomplex (MG), a general model for 
multiresolution cell complexes. An MG consists of a DAG that represents a 
partially ordered set of modifications refining an initial cell complex; the partial 
order is induced by the relation of direct dependency, where a modification 
that deletes some top-cells from the current complex directly depends on those 
modifications that have previously introduced such top-cells. 

Gell complexes at variable resolution can be obtained from an MG by select- 
ing a subset of modifications consistent with the partial order. 

We have shown that the Multi-Gomplex encompasses all existing multireso- 
lution models and helps understanding the range of cell complexes at variable 
resolution that a model can provide. Thus, the MG acts as a unifying framework 
that is useful for comparing the expressive power of different approaches. 
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The definition of the Multi-Complex, and of most other multiresolution mod- 
els developed in the literature is limited to regular cell complexes. However, in 
some applications it is important to deal with non-regular shapes, and modifica- 
tion patterns that change the dimensionality of (portions of) a shape. Extensions 
of the MC framework in such a perspective will be the subject of future research. 
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Abstract. In many applications, e. g. object recognition, decomposition 
of a shape is of great interest. We present a decomposition algorithm for 
3D shape that is based on a multiresolution structure. The shape is hi- 
erarchically decomposed according to local thickness. A merging process 
is introduced for merging of small components to more significant parts. 
As a side effect of the algorithm, we also obtain a way of smoothing noisy 
shapes. 



1 Introduction 

Shape representation is an essential part of image analysis, especially in object 
recognition. A shape can be represented in many different ways, for instance 
by using skeletonization or decomposition. It is commonly accepted that human 
shape perception is partly based on decomposition and also that it is a hierar- 
chical process. Therefore, automatic decomposition methods for volume images 
should also be hierarchical, and result in parts that are recognized as relevant 
parts. 

Several approaches to decompose 2D digital shapes are proposed in the lit- 
erature. In the often cited article [2], shapes are decomposed into simple compo- 
nents, “geometrical ions”, geons. The geons consist, in that article, of generalized 
cones. The shape is segmented and the obtained parts approximated with the 
best fitting geon. This can, as mentioned in [2], be seen as analogous to the usage 
of phonemes in the lexical access during speech perception. For example, only 44 
phonemes are needed to code all the words in English. For 2D images, 36 geons 
are needed. 

In [10], shapes are decomposed by the usage of necks and limbs. A shape is 
divided into parts by part-lines, which are curves with endpoints on the boundary 
of the shape. Each part-line is entirely embedded in the shape and divides the 
shape into two connected components. In [II], the decompositions made from 
this algorithm are compared to decompositions made by a group of humans. 

Often skeletonization is used as a first step in the decomposition, [12,7,9]. 
In [12], the skeleton is segmented. Small parts of the shape consist of unions of 
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the maximal discs centered on points in the skeleton segment. These small parts 
are merged to “roughly convex” parts of the shape, thus introducing the final 
decomposition of the shape. In [7] the skeleton is segmented into approximately 
straight segments. The union of the parts associated to the skeleton segments 
is not guaranteed to cover the original shape, which, in some applications, is a 
disadvantage. Both the above papers are based on mathematical morphology. 
In [9], the skeleton is used, but not mathematical morphology. The skeleton 
is there seen as a 3D curve, where 3D coordinates are the planar coordinates 
and the distance label of the skeleton pixels, and is segmented into rectilinear 
segments. Simple regions can be associated to the segments, which creates a 
shape decomposition. 

The above algorithms are for 2D images, or 2D projections of a 3D scene. 
Algorithms for decomposing 3D shape also are of great interest, but are less fre- 
quently found in the literature. Some of the 2D algorithms previously mentioned 
are supposed to be straight forward to extend to the 3D case, such as [7]. This 
is, however, in general not easy. We will present a hierarchical decomposition 
algorithm for 3D (volume) digital shapes. The algorithm uses a multiresolution 
representation of the shape and is the 3D extension of an existing algorithm for 
the 2D case, [5]. In addition, a process for merging small components to more 
significant ones is introduced. As a side effect of the algorithm, we obtain a way 
of smoothing noisy shapes. 

The basic idea of the algorithm is to order components according to local 
thickness, which is a perceptually relevant criterion. Local thickness is “mea- 
sured” as persistence in a resolution pyramid built from the shape. Lower res- 
olution shapes are projected onto higher resolution levels, identifying more or 
less significant parts. At each level in the resolution pyramid, a decomposition 
is achieved. The decomposition of the original resolution shape is the most in- 
teresting one, which can then be used to analyze the shape. 



2 Definitions 

A volume image consists of voxels; a voxel has three types of neighbours among 
the 3x3x3 closest voxels: six sharing a face, twelve sharing an edge, and eight 
sharing a point with the voxel. From these three types of neighbours, three differ- 
ent connectivities can be defined: 6-connectivity, based on the face-neighbours, 
18-connectivity, based on the face- and edge-neighbours, and 26-connectivity, 
based on the face-, edge- and point-neighbours. In this paper we define a 3D 
shape as a 26-connected set of voxels and background as a 6-connected set of 
voxels. Both shape and background can consist of several components. 

A multiresolution representation of a 3D shape is the 2x2x2 binary pyra- 
mid. Each level of the pyramid is represented by a three-dimensional array, where 
voxels are either shape voxels or background voxels. The highest resolution level 
is the original image array. This level (as well as all other levels) is partitioned 
into 2x2x2 voxel blocks. For each succesive level one such block, the eight 
children, is represented by a single voxel, the parent. The pyramid ends when 
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the last level consists of a single voxel. Various rules can be used to determine 
if the parent voxel should be an object or a background voxel. 

A simple multiresolution structure is the AND-pyramid. The parent becomes 
a shape voxels iff all its eight children are object voxels, otherwise it becomes a 
background voxel. In the AND-pyramid, the object shrinks more than in pro- 
portion to the resolution, the object becomes disconnected, and will disappear 
at some level before the last one. An example of an AND-pyramid is shown in 
Figure 1. 




Fig. 1. An AND-pyramid of a PotPlant shape. 

Alternative ways to build binary pyramids, were the aim was to preserve 
object shape and topology as well as possible in lower resolutions, have been re- 
cently introduced, [6]. It’s exactly because of the capacity they have to faithfully 
represent object shape that these methods are not suited to originate object de- 
composition based on width information. On the contrary, the AND-pyramid is 
not good for representing object shape, but is well suited for our decomposition 
scheme. At each intermediate level of the AND-pyramid, only some of the regions 
into which the object can possibly be decomposed are actually present: those al- 
ready existing at lower resolutions and those appearing for the first time at that 
resolution. The latter regions can be used as seeds to originate decomposition at 
higher resolution. 

Every shape voxel can be labelled with the distance to the closest background 
voxel, using a suitable metric. Three simple metrics are D®, and where 
the distance from a voxel u to a voxel v is defined as the number of voxels in the 
minimal 6-, 18-, and 26-connected path between u and v, respectively. Distance 
labels are produced by computing the distance transform (DT) of the image. For 
the simple metrics we get I?T®, and , respectively. If distances more 

similar to digital Euclidean distance are needed, the so called weighted distance 
transforms (WDT) can be used. In the WDT, different weights are used to 
measure unit steps in various directions. Frequently used weights are 3, for the 
distance to a face-neighbour, 4, for the distance to an edge-neighbour, and 5, 
for the distance to a point-neighbour, [3]. This WDT is denoted the 3 — 4 — 5 
DT, and it is often a good enough approximation of the Euclidean distance. The 
digital Euclidean DT (EDT) can also be computed, [8]. 

The DT, using D®, D^®, or a WDT, of an image can easily be computed 

using a simple two-pass algorithm. Shape voxels are first given the value infinity. 
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Distances are then propagated in a forward and a backward pass. Each voxel is 
labelled with the minimum of the label of the current voxel and the labels of the 
already visited neighbours increased by the corresponding local distances to the 
current voxel. The EDT requires more passes, [8]. 

By an analogue two-pass sequential algorithm, the reverse distance transform 
(RDT) can be computed from a number of labelled seed voxels. Each voxel is 
assigned the maximum of the label of the current voxel and the label of the 
already visited neighbours decreased by the corresponding local distances to 
the current voxel. The algorithm for computing the reverse EDT is much more 
complicated, [4]. 

The distance label of any voxel in the distance transform can be interpreted 
as the radius of a sphere, in the chosen metric, centred on the voxel. (The sphere 
can be obtained by applying the RDT with the centre voxel as a seed voxel.) 
Per definition, this sphere is included in the object. A shape voxel is said to be 
a centre of a maximal sphere if the associated sphere is not completely covered 
by any other single sphere. The union of the maximal spheres coincides with the 
whole object. The centres of maximal spheres can be detected analogously to 
the 2D case, [1,4]. In the cases of D®, and they are simply the local 
maxima in the DT. 

3 Decomposing a digital shape 

The idea behind the decomposition is to use the intermediate levels of an AND- 
pyramid to determine local thickness, measured as persistence in the levels. 
There are several reasons for choosing the AND-pyramid: When resolution de- 
creases, noisy protrusions disappear, the shape decreases in size, and no regions 
are added. Thus, a part of the shape that “survives” at low resolution is thicker 
and more significant than a part that disappeares already in relatively high 
resolutions. By projecting essential points in the surviving parts onto the corre- 
sponding children/descendants at higher resolution, we get markers at each level 
that will be used as a basis for decomposition at that level. 

The binary image with the shape to be decomposed is stored in the bot- 
tom level, the first level, of a resolution pyramid. The shape is denoted Si and 
is assumed to not touch the frame, i. e. the planes a; = 0, x = maxx, y = 0, 
y = maxj/, z = 0 and 2 = maxz contain no shape voxels. If the image dimension 
is not a power of two, background planes can be added as necessary. 

From the first level an AND-pyramid is built. The successive shapes are 
denoted S 2 , S 3 , . . . , Si. The last level which still includes some shape voxels is 
called the last level and is denoted 1. After building the AND-pyramid, the DT 
for each shape. Si, is computed. Any metric can be used when computing the 
DT. We have made experiments with the D®, D^® and 3 — 4 — 5 metrics. 

3.1 Smoothing 

Each of the shapes Si is converted into a series of smoothed shapes, and these 
shapes will eventually be used to produce the decompsition at level i. First we 
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smooth the shape at level i, using the parent resolution level i+1, thus producing 
For each shape voxel in the parent level z + 1, the distance values of its 
2x2x2 children, found in DT of Si, are copied into an empty, level z image, 
DTi^i+i. The shape at level z + 1 is thus projected onto level z, but with voxels 
labelled as in the DT of the original shape Si. The RDT is applied to the DTi^i+i 
resulting in RDTi^i+i. Binarizing RDTi^i+i gives the smoothed shape Si^i+i. The 
process is illustrated in Figure 2. The levels S 2 and S 3 of the PotPlant in Figure 1 
are shown, together with and S'2,3 which is the result of smoothing level 

two using information from level three. (Voxels of 0 X 2^3 are labelled as the 
analogous voxels in the DT of S'2). 




Fig. 2. Computing a smoothed version of the PotPlant level two. See text for expla- 
nations. 

Generally, the shape is smoothed with respect to Si, as not all centres 

of maximal spheres of the DT of Si will be present in DTi ij^i. Especially centres 
of small, border spheres will be partially lost. The shape will not show the 
uneveness of the lower resolution enlarged, however, due to the RDT applied to 
the seeds indentified by the projection. 

Smoothing of Si is repeated eith the grandparent level z -h 2 as seed level so 
originating S'i^i+ 2 . Each shape voxel in the grandparent level z -I- 2 is projected 
onto its 2^ X 2^ X 2^ grandchildren and their values from the DT of Si are copied 
into an empty image, DTi^i_i_ 2 . After applying the RDT, RDTi^ij^ 2 , and binarizing 
the result, the smoothed shape Si^i +2 is obtained. 

This is repeated, using all ancestor levels, until the seed level reaches level 1. 
The result is a series of more and more smoothed shapes Si^k, i < k <1. 

3.2 Decomposition 

The smoothed shapes Si^k are used for the decomposition of Si in the following 
way. Let all shape voxels in Si be labelled z. If the last level is the fth level of the 
pyramid, assign the value I to all voxels in Si that also belong to Si^i. Continue 
by assigning value I — 1 to all voxels labelled z in Si that also belongs to Si^i-i. 
Proceed until all labels k, I > k > i, have been assigned. The labels account 
for the level of the hierarchy to which the voxels belong. The decomposition 
of level three of the PotPlant is shown in Figure 3. On the left is the original 
level, in the middle all voxels labelled 4 and on the right all voxels labelled 3. 

The shape Si has now been decomposed. Note that decomposition is obtained 
at all levels of the AND-pyramid, even though in many cases the decomposition 
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Fig. 3. Decomposition of the third level of the PotPlant (left) into two components, 
were voxels are labelled 3 (middle) and 4 (right). 



of Si is the most interesting one. However, many components in this first de- 
composition will either be very small or be shaped as thin slices of skin. Thus, 
a post-processing step is necessary, where these slices are merged to adjacent 
significant components. Such a merging step is described in Section 4. 

The AND-pyramid is, as any binary pyramid, very translation dependent. 
This implies that if the original shape is shifted only one step in any di- 
rection, the decomposition will become different. One approach to make the de- 
composition more translation invariant is to shift the shape a number of possible 
ways, make the relative decompositions, and then move the shape back to its 
original position. The combined decomposition is obtained by assigning to each 
voxel the maximum of the labels in the different decompositions of the shape 
in the various shifted positions. We denote the case when the decomposition is 
computed from the shape in its original position only ATVi^i-decomposition. By 
also shifting the shape one step in a;-direction, in j/-direction, and in ^-direction, 
we compute three further decompositions, that are combined, to originate the 
AA^Z? 4 -decomposition. An even more translation independent decomposition is 
the AA^ZJg-decomposition, where 8 decompositions are used corresponding to 
the 8 possible shiftings of a 2 x 2 x 2-block. 

The decomposition algorithm described in this Section can be summarized 
by the following pseudo-algorithm: 

from Si build AND-pyramid — > S' 2 , S 3 , . . . 
compute DT of S'!, S' 2 , . . . 

1 : =number of levels 
for k=l downto 2 
for i=l to k-1 

for each voxel G S^ 

set distance value of its children into DTi^k 
compute RDT of DTi^k RDTi^k 
binarize RDTi k Si^k 
for i=l to 1 

for each voxel G Si 

set the value i into Di 
for k=l downto k+1 

for each voxel G Si^k 

set the value j into Dk 
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4 Merging 

As mentioned in Section 3 the decomposition will often contain parts that are 
very small or shaped as thin slices of skin. These should be merged to adjacent 
more significant parts of the decomposition. A simple way of merging is to remove 
all small or skin-shaped parts and free their voxels for relabelling with the label 
of the adjacent component they merged to. 

In general, skin-shaped components are very thin and do not include internal 
voxels, so that one single shrinking step is enough to remove all of them. Shrink- 
ing can be done on all decomposition components simultaneously by setting to 
the background value 0 all border voxels, i. e. all shape voxels having at least 
one 6-neighbour not labelled with the same value as the voxels themselves. The 
region occupied by the removed voxels from the original decomposition should 
then be recovered by adjacent regions. To this aim, labels remaining after the 
deletion of thin components, are iteratively propagated into the deleted parts. 
In practice, at each each iteration all shape voxels in Si labelled zero are given 
the label of their largest labelled 6-neighbour. The most significant levels of the 
hierarchy are favoured here, since if a voxel is reached simultaneously by waves 
propagating from differently labelled kernels, the highest value is chosen. 

It may happen that no propagating labels remain in some subset of the shape 
consisting only of thin components. If that is the case, the voxels remaining 
unlabelled after the propagation are given the largest value of their children (or 
grandchildren) . 

To cause merging of thicker components, we can define a border voxel as 
a voxel having at least one 18-, or even one 26-neighbour not labelled with the 
same value as the voxel itself. Alternatively, more than one shrinking step can be 
performed. This is equivalent to setting to zero voxels in the DT with label not 
greater than the number of desired shrinking steps. If several shrinking steps are 
used, a WDT can ensure direction independent shrinking. In this case, shrinking 
is the same as binarizing the WDT of the shape at a certain distance value. 

This merging algorithm is easy to implement and performs fairly well. It has, 
however, the serious drawback that elongated thin parts in the decomposition 
are likely to be shortened as their peripheral voxels will be absorbed by adjacent 
more significant components. This would be the case, for example, for the thin 
elongated bar connecting the two weights in a dumbbell. This happens especially 
often when we use more than one shrinking step. 

A more sophisticated merging algorithm investigates whether a component is 
small or not and then also considers if most of its border voxels are neighbours of 
a single adjacent component. If this is the case, the component should be merged 
to the adjacent component. The components are investigated in increasing level- 
order. It is here important to keep in mind that there will be a lot of small 
components, especially for the first levels. 

To decide whether a component, C, is small or not, the DT of the component 
is computed. A component is defined as small if all voxels in the component have 
distance values less than or equal to a chosen maximal label, M. Other definitions 
of “small component” are possible, but we have found this simple one to work 
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well in most cases. If C is found to be small, the adjacent components belonging 
to a higher level are investigated. Merging C to the component C is done if C 
is the component adjacent to C with the highest ratio, Km, between the number 
of voxels of C having a neighbour in C and the total number of voxels of C, 
and if Km is higher than a chosen value. 

This more sophisticated merging algorithm generally performs better than 
the simple one. The only case where it does not perform well is when a lot of 
small and skin-shaped parts are connected to a “more significant part” at the 
same level. An example of this will be shown in Section 5 (Figure 5). This fact 
suggests that both merging processes should actually be used. The simple one 
to remove noisy skins, the second process to reduce the number of components 
in a more natural way. 



5 Results and experiments 

Some examples of shape decompositions are shown in Figures 4-7. The highest 
resolution level is 2® x 2® x 2® in every example, i. e. the images are 64 x 64 x 64 
voxels. The shapes are chosen to show some special cases. In Figure 4, a decom- 
position of the PotPlant-shape from Figure I is shown, both before and after 
merging of small components. Figure 5 and 6 show two different decompositions 
of a Dog-shape. The Dog and the PotPlant are made by joining different geomet- 
ric shapes, such as cones, cylinders, and spheres. The Horse-shape in Figure 7 is 
built from a real 2D image of a Horse. The skeleton of the 2D Horse is computed 
and then inserted into a volume. Thereafter the RDT is computed and the result 
is binarized to get the synthetic 3D Horse. 

In all these examples, the 3 — 4 — 5 metric is used when computing the 
smoothed shapes Si^k- This gives a bit more rounded parts in the decompo- 
sition than the simpler D®, and D^®. The AIYDg-decomposition is used in 
all cases. In Figures 4, 5 and 7, only the more sophisticated merging algorithm 
has been used, with different values for the maximal label M, the adjacency 
ratio Km, and different shrinking strategies. In Figure 6, one step of the simple 
merging algorithm was performed, before the final merging. 

The metric and the constants M and Km used in the merging process may 
vary depending on the type of shapes to be decomposed. Different choices can 
lead to the same decomposition. For example, the same result for the decompo- 
sition of the PotPlant in Figure 4 can be obtained by choosing the metric D®, 
M = 3 and Km = 0.5. Km should be set to about 0.5 to ensure that the com- 
ponent to be merged is skin-shaped, or small, because then at least one out of 
two voxels have to have a neighbour in the adjacent component. In Figures 5, 6 
and 7, the metric D® together with M = 1 are chosen as merging parameters. 
This implies that only components whose voxel all having a 6-neighbour in the 
background or in an adjacent component (i. e. all voxel in the component have 
distance value 1 in the DT of the component) should be merged. 

From Figures 1 and 4 it is easy to follow the idea of our decomposition 
algorithm. We can see that, in the AND-pyramid, the shape in level 2 is very 
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Fig. 4. Decomposition of a PotPlant. On the left is the original decomposition, on the 
right the decomposition after merging. The 3 — 4 — 5 metric is used for smoothing. The 
metric is used together with M — 2 and Km = 0.5, for merging. 



similar to the shape in level 1. For the shape in level 3, stem and petals have 
vanished, and in level 4 the only thing left is the pot. This corresponds to that the 
pot should belong to level 4, the calyx should belong to level 3, and the stem and 
the petals to level 2, which is exactly the decomposition shown in Figure 4 right. 

The only example where a part belonging to level 1 still exists in the de- 
composition after merging is the Horse in Figure 7. When using the AND^- 
decomposition, it is often the case that the smoothed shape S' 1^2 fully describes 
Si except for a few scattered voxels. These remaining voxels are merged to other 
components. In the Horse case, one of the legs consists of 2 x 8 x 1 voxels. Since 
in the AND-pyramid only components larger than 2x2x2 voxels are preserved 
at the immediately lower resolution, it is not possible to preserve the leg of the 
Horse to level 2, in any position of the shape. 

In Figure 5, a situation which does not happen very often is shown. Many 
small and skin-shaped parts appear at level 3. These would be expected to be 
merged into level 4, the body of the Dog. However, these parts form a single 
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Fig. 5. Decomposition of a Dog. The 3 — 4 — 5 metric is used for smoothing. The D® 
metric is used together with M — 1 and Km = 0.5, for merging. 
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Fig. 6. Decomposition of a Dog. The 3 — 4 — 5 metric is used for smoothing. One step 
with the simple merging algorithm is taken using 6-connectivity for the definition of 
border voxels. For the final merging the metric is used together with M = 1 and 
Km = 0.5. 



connected component with the head of the Dog, since we use 26-connectivity for 
the shape. This component can not be considered as small. When this happens, 
the simple merging algorithm is of great help. It removes most of the small and 
skin-shaped parts at level 3. The ones remaining are no longer connected to the 
head of the Dog and are then easily removed by the more sophisticated merging 
process. The result is shown in Figure 6. It is possible to see, on a close-up, that 
legs and tail are a bit shorter than in Figure 5. This unwanted effect is due to 
the use of the simple merging. 
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Level 1 




Level 4 



Fig. 7. Decomposition of a Horse. The 3 — 4 — 5 metric is used for smoothing. The 
_D® metric is used together with M — 1 and Km = 0.5, for merging. 



Through experiments we have seen that the AND4- and AA^ZJg-decomp- 
ositions give more stable results than the >liVDi-decomposition. Even though 
these are more computationally heavy than the AA^Z?i-decomposition, their 
use is in most cases necessary. The difference between the AND 4- and ANDs- 
decomposition is not equally large and the choice can be made from the compu- 
tational power available and the stability needed. 

Even though the AA^Dg-decomposition is selected, the decomposition is still 
not fully translation invariant. Examples of this can be seen in Figure 5 and 6. 
Here, depending on the position of the shape when making the AND-pyramids, 
two of the legs of the Dog belong to level 2 and the other two belong to level 3. 
To make the decomposition totally translation independent the shape should 
be decomposed in all positions corresponding to the possible shiftings in a cube 
with sides of the size corresponding to one voxel in the last level Z, i. e. if Z = 4 
the cube would be 2*“^ x 2*“^ x 2*“^ =2^ x 2^ x 2^ voxels and thus 64 different 
positions. This would of course be computationally very heavy. However, the 
AA^Dg-variant gives a good approximation. 

When implemented (without optimization) on a standard workstation com- 
puting the decompositions in Figure 4-7 takes about 25 seconds each, when 
using the AA^Dg-decomposition and no merging. 
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6 Conclusion 

We have presented an algorithm based on a multiresolution representation of 
a three-dimensional digital shape that decomposes a shape into different parts 
depending on local thickness. Small and skin-shaped parts are merged into more 
significant parts by a merging process. A fairly translation independent decom- 
position is obtained, by shifting the shape in different directions and by suitably 
combining the various decompositions. 



Acknowledgements 

Many thanks to Dr. Ingela Nystrom for proofreading and advice. 



References 

1. C. Arcelli and G. Sanniti di Baja. Finding local maxima in a pseudo-Euclidean 
distance transform. Computer Vision, Graphics and Imaqe Processinq, 43(3):361- 
367, Sept. 1988. 

2. I. Biederman. Recognition-by-components: A theory of human image understand- 
ing. Psychological Review, 94(2):115-147, 1987. 

3. G. Borgefors. On digital distance transform in three dimensions. Computer Vision 
and Image Understanding, 64(3):368-376, Nov. 1996. 

4. G. Borgefors and I. Nystrom. Efficient shape representation by minimizing the 
set of centres of maximal discs/spheres. Pattern Recognition Letters, 18:465-472, 
1997. 

5. G. Borgefors and G. Sanniti di Baja. Parallel smoothing and decomposition of 
digital shapes using a multiresolution structure. In Proceedings 10th International 
Conference on Pattern Recognition, pages 745-748. IEEE Gomputer Society Press, 
1990. 

6. G. Borgefors, G. Sanniti di Baja, and S. Svensson. Multiresolution representation 
of shape in binary images ii: Volume images. In E. Ahronovitz and C. Fiorio, edi- 
tors, Discrete Geometry for Computer Imagery (DGCP97), pages 75-86. Springer 
Verlag, Berlin Heidelberg 1997. Lecture Notes in Gomputer Science 1347. 

7. A. Held and K. Abe. On decomposition of binary shapes into meaningful parts. 
Pattern Recognition, 27(5):637-647, 1994. 

8. I. Ragnemalm. The Euclidean distance transform in arbitrary dimensions. Pattern 
Recognition Letters, 14(ll):883-888, Nov. 1993. 

9. G. Sanniti di Baja and E. Thiel. (3,4)-weighted skeleton decomposition for pattern 
representation and description. Pattern Recognition, 27(8): 1039-1049, 1994. 

10. K. Siddiqi and B. B. Kimia. Parts of visual form: Computational aspects. IEEE 
Transactions on Pattern Analysis and Machine Intelligence, 17(3):239-251, Mar. 
1995. 

11. K. Siddiqi, K. J. Tresness, and B. B. Kimia. On the anatomy of visual form. In 
C. Arcelli, L. P. Cordelia, and G. Sanniti di Baja, editors, Aspects of Visual Form 
Processing, pages 507-521. World Scientific Publishing Co. Pte. Ltd., 1994. 

12. J. Xu. Morphological decomposition of 2-D binary shapes into simpler shape parts. 
Pattern Recognition Letters, 17(7):759-769, 1996. 




Optimal Time Computation of the Tangent of a 

Discrete Curve: 

Application to the Curvature 



Fabien Feschet and Laure Tougne 
{f f eschet , ltougne}@univ-lyon2 . f r 



Laboratoire E.R.I.C. 
Universite Lyon 2 
5 av. Pierre Mendes-France 
69676 Bron cedex, France 



Abstract. With the definition of discrete lines introduced by Reveilles 
[REV91], there has been a wide range of research in discrete geometry 
and more precisely on the study of discrete lines. By the use of the lin- 
ear time segment recognition algorithm of Debled and Reveilles [DR94] , 
Vialard [VIA96a] has proposed a 0{l) algorithm for computing the tan- 
gent in one point of a discrete curve where I is the average length of 
the tangent. By applying her algorithm to n points of a discrete curve, 
the complexity becomes 0{n.l). This paper proposes a new approach 
for computing the tangent. It is based on a precise study of the tangent 
evolution along a discrete curve. The resulting algorithm has a 0{n) com- 
plexity and is thus optimal. Some applications in curvature computation 
and a tombstones contours study are also presented. 



Keywords: discrete tangent, discrete curve, tombstones contours study. 

1 Introduction 

The work we present in this paper is inserted in a project concerning extraction 
of geometrical characteristics from tombstones contours. As we have to treat 
thousands of such figures, our goal is to find an automatic process to extract 
these characteristics. The most used properties of discrete curves are tangent 
orientation [SD91] and the curvature value in each point [VIA96b], [WS95]. 

In this paper, we present an optimal time algorithm for computing the char- 
acteristics of the discrete tangent in all the points of a 8-connected discrete curve. 
Although this algorithm is available for all the curves (open or closed) which do 
not self-intersect, in order to simplify the explanations we choose to present it 
in the case of open curves. 

In section 2, we remember Vialard’s algorithm [VIA96b] to compute a dis- 
crete tangent in one point of a discrete curve. This algorithm is based on De- 
bled’s algorithm [DR95] used to recognize discrete segments. The complexity of 
Vialard’s algorithm is 0(n.l) where n is the number of points of the curve and I 
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the average length of the tangent. The section 3 describes our optimal method 
to compute the tangent in all the points of the curve. We give the pseudo-code of 
the algorithm and prove its complexity. In section 4, we explain how to deduce 
the curvature and provide an application. 



2 Vialard’s algorithm 

In [VIA96b] and [VIA96a], A. Vialard describes a linear time algorithm to com- 
pute the curvature in one point of a discrete curve. This algorithm proceeds in 
two steps: first, it computes the discrete tangent associated to the current point 
and second, it uses this approximation of the tangent to determine the curva- 
ture at this point. In the following, we describe the first step with precision. The 
second one is described in section 4. 

The notion of discrete tangent is based upon the arithmetical line defined as 
follows. 

Definition 1 ([REV91]). 

A set of points T> (Z is an arithmetical line if and only if there exists four 
integers a, b, pi, to such that: VM = (x,y) G T>, pi< ax — by</j, + uj where | 
represents the slope of the line, pi its position in the plane and oj its thickness. 

In the following, we only consider the case where co is equal to sup(| a |, | 6 |). 
It corresponds to the usual 8-connected lines. In this paper, a curve is a set 
of points {co,...,c„} (with n a natural number) such that the points c, and Cj 
{0 < i < n, 0 < j < n) are 8-adjacent if and only if \j — i\ = 1 ([eAM91]). 

A discrete segment is a bounded and connected subset of an arithmetical 
line. A discrete tangent in a point P of a curve is a subset of points surrounding 
P which is a discrete segment. To test if a set of points is a segment, A. Vialard 
uses the segment recognition algorithm of Debled and Reveilles [DR94] . For a 
given set of points, their algorithm finds the longest subset starting on the first 
point, which forms a discrete segment. Moreover, it returns the characteristics 
a, b and pi of the segment. The complexity of this method is 0{l) where I is the 
number of points of the recognized segment. 

There exist many definitions of the discrete tangent in a point P = Ci (0 < 
t < n) of a curve C = {cq, . . . , c„}. Among those definitions, we can consider the 
following one. 

Definition 2. The tangent at P = Ci is a segment {ci_fc, . . . , Cj, . . . , c^+fc} such 
that {ci-k-i, ■ ■ ■ ,Ci+fc+i} is not a segment (with k a natural number). 

It consists in adding pairs of points of the curve symmetrically around P until 
the resulting set is not a segment. 

We can also try to continue the process only at one side (first on the right and 
second on the left) of the curve. Hence, the tangent we obtain is defined as 
follows. 
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Definition 3. The tangent at P = Ci is a segment {ci_fc, . . . , Cj, . . . , Ci+k+p\ 
(resp. {c^-k-q, . .. ,Ci,. .. ,Ci+k}) such that neither . . . ,Ci+fc+i} nor 

{ci_fc, . . . , Ci+k+p+i} (resp. {ci-k-q-i, • ■ • , Ci+k}) is a segment (with k and p two 
natural numbers). 

In definition 2, the tangent is independent of the direction of travelling along 
the curve and is symmetric around P. However, the tangents we obtain are 
smaller than the ones we have with the definition 3 and so, are less precise. A. 
Vialard considers the second one as we do in this article. 

So, we obtain the discrete tangent in P and its characteristics a, b, p,. The 
complexity of this algorithm is 0{l) where I is the length (in number of pixels) 
of the tangent. As Vialard noted in [VIA96a] the execution time can be reduced 
by modifying the algorithm of Debled and Reveilles such that adding a point on 
the left is similar to adding a point on the right, thus consisting only in updates 
of the a, b and p parameters. Debled’s algorithm computes the characteristics 
of the segment with the leftmost point being the origin. However, in Vialard’s 
algorithm, the parameters of the tangent are computed with P being the origin. 
Changing the origin is done in 0(1) by modifying the p parameter. 

Using Vialard’s algorithm for each point of the curve, the global complexity 
of the tangent computation is 0(/.n) where I is the average length of the discrete 
tangent. When computing the tangent for all the points of a discrete segment, 
the complexity of the previous algorithm becomes O(n^) with n the length of 
the segment. This can be considered as a drawback of the algorithm. Intuitively, 
we expect to do the computation only on the first point and then detect all the 
points of the discrete segment that have a tangent with same characteristics. 
This is the case of the algorithm we present in the following, which complexity 
is 0{n) where n is the length of the curve. 



3 A new algorithm for tangent computation 

If we consider a discrete segment, the tangent is constant for all the points. 
So the key idea is to determine subsets of points having a constant tangent. 
More precisely, we look for extremal points of pieces of the curve with constant 
tangent. We only compute the characteristics of the tangent in these points, 
thus reducing the complexity of the tangent determination. In the following we 
describe the method with a complete example, prove its correctness and study 
its complexity. 



3.1 The method 

Let us consider an open discrete curve and a point P on this curve. By applying 
Vialard’s algorithm in P, we compute the discrete tangent (see figure 1). We 
denote by R the first point we reject on the right. So, the tangent in R and in 
P cannot be the same ones. Moreover even if [P, i?[ is a segment, the tangent in 
each of the points of this segment may have not the same characteristics. Now 
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ESI point of the tangent 
I first rejected point 
I other point of the curve 



Fig. l.The tangent in P obtained by applying Vialard’s algorithm {R the first rejected 
point) 



we look for the first point M between P and R for which the tangent has not 
the same characteristics as the one in P. 

In figure 1, if we consider the right neighbor of P called P' and if we apply 
again Vialard’s algorithm, we will find the same characteristics for the tangent. 
This comes from the fact that, in the figure, the point R can not be added to the 
tangent in P' . In fact, the tangent parameters remain constant for all the points 
for which the tangent does not contain R. M is the first point the tangent of 
which contains R {{M , . . . , i?} is a segment). In order to determine the position 
of M, we test all the points on the left of R to find the subset of points which 
belong to the discrete segment starting at R (see figure 2). To do this, we make a 
backward read of the curve from R and we call L the first rejected point, which 
can be either on the left or on the right of P. Since the set of points {L , . . . , i?} is 




point of the recognized line 
first rejected point 
other point of the curve 



Fig. 2. Recognized line from R by applying Debled’s algorithm {L the first rejected 
point) 



not a segment but {M , . . . , i?} is, M is a point of ]L, R] . The goal of the following 
paragraphs is to prove that M is the middle of ]T, i?]. 

Let us now consider the computation of the tangent in M. In a first step, 
we add pairs of points symmetrically around M . Then three cases can occur 
according to the position of M in ]L, i?] : 

— L appears strictly before i? in a pair. That is to say d{L, M) < d(M, R). Then 
this pair is accepted because in this case the current computed tangent in M 
is included in the one in P. Hence, since the set {L, . . . ,R} is not a segment, 
R does not belong to the tangent in M. So, the tangent in M has the same 
characteristics as the one in P. This is a contradiction with the definition of 
M. In conclusion, this case can not occur. 

— R appears strictly before L in a pair. That is to say d{L,M) > d{M,R). 
Then this pair is accepted because the current computed tangent in M is 
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included in the backward segment starting at R. Hence, L does not belong 
to the tangent in M which is thus different of the one in P. 

— The pair {L,R) is considered. That is to say d{L,M) = d{M,R). This pair 
is rejected. We try to add points only on one side. As we begin by the right, 
R is added to the current computed tangent in M. Hence, L does not belong 
to the tangent in M which is thus different of the one in P. 



As a consequence, the characteristics of the tangent are different from the one 
in P for all the points closer to R than to L. That is to say for all the points Pi 
that satisfy d{L, Pi) > d{Pi, R). By definition, the point M is the first of these 
points. So, it is the ’’middle of ]L, i?]”. It is defined by d{L,M) = (ggg 

figure 3). 

The characteristics of the tangent in all the points between P and M (not 
included) are constant. Let us note that to keep on computing the tangent in M, 
we can use the characteristics computed for the ]L,R] segment and thus, only 
consider points on the right of R. 

To determine the tangent for all the points of the curve, we use the previous 
remarks. Let us notice that we do not prove yet that M is at the right of P. It is 
very important for the complexity of the algorithm we propose in the following 
section. 

To prove this fact, we have to consider the following three cases. First, if L is 
strictly at the right of P, so M is by definition. Second, L is strictly at the left 
of P. So, P belongs to the segment ]L,R\. But since R does not belong to the 
tangent at P, L appears strictly before i? in a pair while building the tangent 
in P. So, by definition of M, P is strictly at the left of M. Third, if L is equal 
to P then, since [L, R] contains at least three points, M is strictly at the right 
of L = P. 



M R 

jl 



Fig. 3 • Determination of the point M: the middle of the segment ]L,R] (the first point 
the tangent of which contains R) 
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3.2 Pseudo-code 



Algorithm 1.1 Compute-tangent (curve C) 



1 while P belongs to C do 

2 _R=Tangent(P,a,fe,/i) 

3 if i? exists then do 

4 L=Reverse_DR(i?) 

5 M=Middle(P,L) 

6 for every point I between P and M (not included) do 

7 the characteristics of I are a, b, jj,' 

8 else 

9 all points from P to the end of the curve have 

10 the characteristics a, b, fi' 

11 M = 0 

12 end if 

13 P = M 

14 end while 



Let us now give some explanations of the previous pseudo-code. In step 2, 
we compute the tangent in P and, obtain the first rejected point R and the 
characteristics a, b and R exists if we are not at the end of the curve. 

The step 4 consists in recognizing the longest discrete segment from R. It 
returns the point L which is the first excluded point. In step 5, the middle M of 
the segment ]L, i?] is determined. In lines 6 and 7, we affect the characteristics a, 
b and fi' to the points located between P and M (not included), /i' is computed 
from fj, by centering the tangent on the current point [VIA96a] . With lines 9 and 
10, we simply copy the characteristics a, b and to all the points at the right 
of P. At the end, in line 13, we move to M the next point to be considered. 

3.3 Efficient implementation and complexity 

In this section, we propose an efficient implementation of the previous pseudo- 
code by modifying the step 4. In fact, we show that the ReverseJDR function 
call can be avoided by incremental computation of L. 

Let us consider an iteration i of the algorithm in a point Pi as drawn in figure 
4 (a). At this stage, the points Li, Ri, Mi and Gi (which limits the tangent in 
Pi on the left) are known. Let us study the way we compute the next point 
Ri+i and how to determine the next point without using the ReverseJDR 
function. 

In figure 4 (b), the point Mi becomes the point Pi+\. By computing the 
tangent in this point with the previous algorithm, we obtain the points Ri+\ 
and Gi+\. Let us notice that in fact, G^+i is the point Li. Indeed, since Ri and 
Li are incompatible and since Ri belongs to the tangent in Pi+i, Gi+\ is on 
the right of Li. But Li is the rightmost point of the curve not belonging to the 
discrete segment issued from Ri. Thus Li and Gi+i are the same points. As a 
consequence, is on the right of G^+i = Li. 
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(a) 



(b) 



(c) 



Gi 




Piti 

M 



iJ: 




Fig. 4. Two successive computations of the tangent: the points Li+i and Ri+i are 
obtained from the points Li, Ri, Mi and Gi 



Let us now determine the point (see figure 4 (c)). By definition, is 
the point that belongs to the tangent in and which is the rightmost one not 
belonging to the discrete segment issued from Ri+i. So, Li+i is the first point 
such that the set of points on its right, which belong to the tangent in Pi+i, plus 
Ri+i is a discrete segment. Consequently, we can proceed by iteratively deleting 
the leftmost point of the tangent in and testing if the resulting set of points 
plus Ri+i is a segment. 

Let us study the complexity of this process. Let us consider a configuration in 
which we have the points Mi, Li and Ri (see case (2) of figure 5). First, adding 
a point on the right of Ri and checking if the resulting set is still a discrete 
segment (case (3)) has a complexity 0(1) using Debled’s algorithm. Moreover, 
excluding a point on the left and testing if with the point i?i+i we always have 
a segment (case (4)) can also be done in 0(1). As a matter of fact, we have the 
following property ([VIA96a]). 



Gi 

( 1 ) • 
( 2 ) 

(3) 

(4) 



Pi Ri 

• • 



Li ; Mi Rj 

• ^ • • 




Li+1 ' Mi+1 

• ^ • 



Ri+ 



Ri. 



Fig. 5. Hashed rectangles cover the sets of points used during the updates of R and L 



Property 1. Let A(a, b, p,, 1) be a discrete segment. With no loss of generality, we 
suppose that we work in the first octant. Following Debled’s notation [DR95], 



38 



Fabien Feschet and Laure Tougne 



Iq and Sq the principal lower and upper leaning points are also known. The 
characteristics {a,b,fi) of the segment S\ (0,0) are obtained as follows in 0(1): 

Algorithm 1.2 

a' = —a, b' = —b, fi' = fi + a* xm — b * ijm 

Xtemp ~ ^Sq 1 ytemp VSq 

xso = yso = -yio + yM 

^Iq — ^temp H” ^ M "I yiQ — ytemp yM 



if ((6' = Xm' - Xs'^) and (o' = j/m' - J/s' )) then 
a = Vm' -«//'-!,& = Xm' - 
A = (j/M' - t/i' - 1)»M' - {x'm - Xj^)vmi + 1 
®/o = yfo = - 1 

XSq ~ Xs'g 1 VSo ~ ys'g 

else 

if {{b' = Xm’ - */') and (o' = yM' ~ J//' )) then 
a^VM' - Vs;, - 1, 6 = Xm> - Xs'^ 

A = iVM' - Vs'g - l)a:M' - {x'm ~ Xs'^)yM' - b 
®So = *s' - b, Vso = 1/s' - a - 1 
yi'o = yi'o 

else 

a = a' , b — b' 

A = d' 

®s"o = ®s' , Vso = ysj 

end if 
end if 

Consequently, the treatments of each point considered in the stages (2) and 
(4) have a complexity in 0(1). Just notice, as shown in figure 5, that each 
point of the curve is used at most three times (including the one for storing 
the characteristics of the tangent between P and M). Hence, the complexity for 
computing the tangent in all the points of the curve is 0{n). 



4 Some results 



We use the previous algorithm to compute the curvature at each point of a 
discrete curve. A review of curvature estimation is given by Worring and Smeul- 
ders in [WS93]. They showed that a good curvature estimator can be obtained 
by a Gaussian filtering applied to the tangent orientations. More precisely, the 
curvature k, can be estimated by 



Definition 4. 



k{P) 



1.1107 



with 9{P) being the angle between the tangent in P and the horizontal and with 



GA^) 



1 —i'^ \ 
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The coefficient 1.1107 represents the average distance between two points of 
the discrete curve and the window size of 6cr limits the influence of the Gaussian 
function. 

We are involved in a project whose goal is to use classification technics and 
induction graphs technics to extract significant information about tombstones. 
These tombstones have been drawn on sheets of paper by following their con- 
tours with a pencil. Then, the resulting curves have been scanned. Moreover, 
we also have non geometrical characteristics such as the place where the tomb- 
stone has been found, its approximative age, its style, and so on. The goal of the 
project is to try to find a link between the geometrical and the non geometrical 
parameters. The geometry of the tombstones is represented by a set of dominant 
patterns. At the end, if the link exists, it would be possible to obtain an auto- 
matic classification of the tombstones as well as other information such as the 
movement of the population through the study of the style of the tombstones. 
The first part of the project concerns the computation of the dominant patterns. 
In order to obtain such description, we define the characteristic points as those 
with high curvature. To extract these points, we first proceed by smoothing and 
thinning the images so as to obtain strictly 8-connected curves. We apply the 




(a) (b) (c) 



Fig. 6 .(a) the tombstones contour ; (b) curvature graphs for a tombstone contour ; (c) 
high curvature points are marked with little circles 



curvature computation on these curves. However, we have more than a thousands 
of tombstones contours to study and each curves is composed of two thousand or 
more points. Hence, we need a fast algorithm for computing the curvature. This 
is achieved by the linear time algorithm presented in this paper. At the end, we 
extract points using Teh and Chin’s algorithm [TC89]. An example of curvature 
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computation and high curvature points on a tombstone contour is presented in 
figure 6. 

5 Conclusion 

We have presented in this article a new algorithm for the tangent computation 
in all the points of a discrete open curve. We also have applied this algorithm 
to the curvature computation on tombstones contours. Our algorithm has a 
complexity in 0{n) where n is the number of points of the curve and is thus 
optimal. Moreover, it is easily extensible to the case of closed curves which still 
do not self-intersect. This is achieved by only modifying the first step of the 
algorithm. More precisely, we apply once Vialard’s algorithm for the first point 
and then use our algorithm. The complexity remains the same one. 

In future works, we would like to study what happens with not strictly 8- 
connected curves or in a more general case, with thick lines. 
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Abstract. The moment of (p, <7)-order, rrip^qiC), of a circle C given by 

{x — a)^ + (y — b)^ < is defined to be ff x^y'^dxdy. It is naturally to 

c 

assume that the discrete moments dmp^qiC), defined as 
dmp,q{C) = ^ f , 

i,j are integers 

(i-a)2 + (j-b)2<r2 

can be a good approximation for mp^qiC). This paper gives an answer 
what is the order of magnitude for the difference between a real mo- 
ment mp^q{C) and its approximation dmp^q(C), calculated from the cor- 
responding digital picture. Namely, we estimate 

mp,q{C) — dmp^q{C) = / / x^y'^dxdy — j’’^ 

^ i,j are integers 

(i-a)2+0-!,)2<,.2 

in function of the size of the considered circle C and its center position if 
p and q are assumed to be integers. These differences are upper bounded 

with O , where e is an arbitrary small positive number. 

The established upper bound can be understood as very sharp. 

The result has a practical importance, especially in the area of image 
processing and pattern recognition, because it shows what the picture 
resolution should be used in order to obtain a required precision in the 
parameter estimation from the digital data taken from the corresponded 
binary picture.^ 



Key words — Digital geometry, discrete shapes, parameter estimation. 



1 Introduction 

In this paper asymptotic expressions for the discrete moments of the circles in 
the xy-plane will be derived. The basic motivation for this paper comes from 

^ Research supported by Mathematical Institute, SANU, Beograd, under the project 
04M02 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 41—49, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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the area of image processing and pattern recognition. Namely, the mostly used 
mathematical model for computer picture (binary picture) of a considered shape 
is the integer grid whose points (so called digital points) take value 1 for points 
which belong to the given shape and 0 otherwise. 

That is a reason that we can assume (through the paper) that all appearing 
digital (lattice) points have positive coordinates - i.e. the origin is placed into the 
left-lower corner of the observed digital picture. Since our attention is focused 
on the binary pictures of the circles: {x — a)“^ + {y — b)^ < r^, the inequalities 
r < a and r < b will be assumed, if not mentioned. 

Definition 1. The moment of {p,q)~ order, mp^q{S), of a planar shape S is de- 
fined (as usual) to be 

mp^q{S) = J J xPy'^dxdy . 
s 

Remark. The moment mo,o(<5') is the area of a planar shape S and will be 
denoted as P{S). 

Definition 2. The discrete moments dmp^q(S) are defined as 
dmp^q{S) = ^ i^f . 

i,j are integers 

(i.nes 

Definition 3. The three-dimensional (0, 0, 0) -discrete moment, dmo,o,o(-B)> of 
a 3D body B expresses the number of digital points belonging to B, i.e. 

dmopp{B) = ^ 1 . 

i,i,k are integers 

The basic mathematical tool for our derivation is the following result from 
number theory ([!]). 

Theorem 1. If B is a convex body in B? , with boundary and positive cur- 
vature at every point of the boundary, then the number of lattice (i.e. digital) 
points belonging to r ■ B is: 

R{r ■ B) = r^ ■ P{B) + O (rri . (logr)^^ , (1) 

where P{B) denotes the area of B, while r ■ B is the dilatation of B by factor r. 
What we will use is weaker: 

R{r ■ B) = r“^ ■ P{B) -\- O for every e > 0. (2) 

The asymptotical expression for dmQp{C) (where C is a circle of the radius 
r) is a direct consequence of (2): 

E 1 = 

i,j are integers 

(i-a)2 + (j-l,)2<r2 



dmo,o{C) 



7T • -I- O (r 1^1 . (3) 
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2 Case p ■ q = 0 

In thsi section we estimate the difference mp^q{C) — dmp^q{C) in the case when 
one of the integers p and q is equal to zero. We need the following lemma. 

Lemma 1. Let the circle C he given by (x — a)^ + (y — b)^ < r^, and Ci{i) and 
C 2 {i) be determined by 

Ci(t) = I (a;,?/) : {x — a)^ + {y — b)^ < r^ and a; < i | 

and 

C' 2 (*) = |(a;,y) : {x — a)^ + {y — hf' < r"^ and a; > t | , 

where i is an integer satisfying a — r < i < a + r. Then, the following 
(equivalent) relations 

dmo.o(Ci(z)) = P(Ci(z)) + + (* - a)2 + o 

and 

c?mo.o(C2(z)) = P(C2(i)) + ^Jr^^W^^+O 
are satisfied. 

Proof. The conditions of Theorem 1 can be relaxed to allow B having a finite 
number of corners. So (1) can be applied to the intersection of the interiors of 
two convex curves or, by subtraction, to the region within one convex curve and 
outside another (for details, see [1]). Let us mention here that the boundedness 
of the radius of curvature is essential; if a long section of the curve is straight, 
the error term can be as large as the length of the “straight” section. 

Let us consider the circle C, given by {x — {2i — a)) + {y — b)'^ < {C 

is the circle symmetric to C with respect to the line x = i). The convex set 
C n C satisfies the conditions of Theorem 1. So, the number of digital points 
belonging to CD C can be determined as dmo,o(C'n (7) = P{C DC) +0(rn+=). 
The statement follows since the set C n C is symmetrical with respect to the 
line X = i. Obviously, the term + (z — a)^ expresses a half of the number 
of digital points, inside of the circle C, lying on the line x = i, with the error 
bounded by 2. QED 

Now, the asymptotical expressions for the discrete moments dmpfi{C) and 
dmo^q{C) can be given. 

Theorem 2. Let a, b and r be real numbers satisfying r < a and r < b. Then 
the following asymptotical expressions hold: 

dmpfi{C) = ^ ~ ^ > 

i,j are integers ^ 

(i-a)2 + (j-l,)2<r2 O 

dmo,q{C) = ^ j‘^ = ( ( y'^dxdy + O [b'^ ■ 

i,j are integers 

(i-a)2 + (j-l,)2<r2 O 
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Proof. Let’s notice that dmpfi{C) is equal to the number of digital points be- 
longing to the 3 D body 



Q=^{x,y,z) I {x-af' + {y-hf <r'^, Q < z < x^^ = Q' \J Q” , 
where 

Q'=^{x,y,z) I (a; - a)^ -I- (y - 6)^ < (a - r)^ < 2 < a;P| 

and 

g" = ^(x,y,z) I (a; - a)^ -I- (y - &)^ < r^, 0 < z < (a - r)^’| . 

First, consider the number of digital points belonging to the body Q' . If Wi, for 
i = [a — r] , [a — r] -I- 1, . . . , [a -I- rj — 1, denotes the body 

Wi = |(x, y, z) \ {x — a)^ + (y — b)^ < r'^ , x > i, < z < min{x^, {i + , 

then for the volume of Q' we have 

[o-l-rj -1 

vol{g') = ^ uo/(Wi) -I- (([a — r])*’ — (a — r)^) • • 7T -I- • Vr) . 

i=\a—r~\ 



Also, 

vol{Wi) = ((i + 1)P - iP) ■ P{C 2 {i)) - vol{W') , 

where 

IF/ = |(cc,y, z) I (x - a)^ -I- (y - &)^ < r^, x>i, < z < (i -I- 1)^| 
It is easy to derive 

vol{W[) = {{i + I)P - iP) ■ + ((t -h 1)^’ - iP) ■ 0(ri) . 

Now, we have, according to Lemma 1, that 



[a+rj-l [a-l-rJ-1 , 

^ vol{Wi)= {{i + l)P-iP)- (dmo,o{C 2 {i)) 

i—\a—r~\ i—\a—r~\ 




[a+rj— 1 [a+rj— 1 

+ ^ vol{W') = 

i=\a—r~\ i—\a—r~\ 



[a+rJ-1 [a+rJ-1 

= ^ ((z + I)^ - iP) ■ (dmo.o(C2(z))) - ^ ((Z + 1)" - ^P)) ■ - (Z - a)^ + 

i—\a—r] i=.\a—r] 
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[a+rj -1 

+ O (aP-^ ■ ^ vol{W'i) = 

i—\a—r'\ 



[a+rJ -1 

= ^ ((z + l)P-zP).(dmo,o(C2(i)))- 

i— [a— r] 



[a+rj-l 

i— [a— r] 



o (qP-^ 




So, the volume of Q' can be expressed as 

[o+rj -1 

vol{G') = ^ ((i + 1)^ - iP) ■ 

i—\a—r~\ 



^dmo,o(C'2(i)) - 2 • -{i- 



+ ([a — r])^ • 7T • — (a — r)P • tt • + 0{aP~^ ■ 

The next, vol{Q") = {a — r)P ■ tt ■ ^ gives: 
vol{Q) = vol{G') + vol{G") = 

[a+rJ -1 

= X! ((* + 1)^ “ ■ (^’^o,o(C'2(*) - 2 • ^/r2TT(7dra)2^ 

i—\a—r] 

+ i\a - y)P ■ TT ■ y + 0{aP-^ ■ = 

= dmo,o{G') + ([a - rl)P • 7T • r" + 0{aP-^ ■ rn+-)). 

Notice that 

[a+rJ -1 

{{z + l)P-zP)- 

i— \a—r~\ 

expresses the number of digital points in G' (with the error bounded by 
0{aP~^ ■ r)), excluding the points belonging to the plane 2 : = (|"a — r~\)P. 

Since the term 

([a — r])^ • 7T • + 0(a*’ • rn+®) , 

equals the number of digital points belonging to the body G” , adding the 

number of digital points belonging to the plane z = ( [a — r] (if a — r is not an 

integer), we have 

Wp,o(C) = vol{G) = dmo,o,o{G) + 0{aP ■ = dmp^o{C) + 0{aP ■ rn+®) , 

which completes the proof. QED 
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3 Case p ■ q > 0 

In this section asymptotical expressions for the differences mp^q{C) — dmp^q{C ) , 
where both p and q are strictly positive integers, will be given. 

Theorem 3. Let a, b and r be real numbers satisfying r < a and r < b. Then 
the following asymptotical expression holds: 

dmp,q{C) = i^ -f = f f xP ■y'^ -dx-dy + o(aP ■b‘1 ■ r . 

i,j are integers ^ '' 

(i-a)2 + (i-6)2<r2 O 

Proof. Let’s notice that dmp^q{C) is equal to the number of digital points be- 
longing to the 3D body 

H=^{x,y,z) I {x — a)"^ + {y — b)^ < r'^ , 0 < z < xP ■ y‘^'^ = H' LI H" , 

where 

H' = ^{x,y,z) I {x - a)'^ + {y - b)'^ < r"^ , Zmin < z < xP ■ y'^'^ 

and 

H"=^{x,y,z) I {x-af + {y-b)'^ <r^, 0 < z < Zmin'j ■ 

Zmin is the minimal 2 - value on the curve which is the intersection of the surfaces 
{x — a)^ + {y — b)'^ = and z = x^ • y'^ . In other words: 

Zmin = min{z I z = xP ■ y'^ and [x — a)^ + {y — b)^ = r^} 

Analogously, Zmax is defined to be the maximal z value on the same curve: 

Zmax = max{z I z = xP • y'^ and (x — a)'^ + {y — b)^ = r^} . 

First, consider the number of digital points belonging to the body H' . If Vi 
denotes the body 

Vi = ^{x,y, z)\ {x — a)^ + {y — b)^ < r“^ , xP ■ y'^ >i, i < z < min{a;^ • i -I- I}| , 

then for the volume of TC we have 
L^ma® J 1 

= ^ vol{Vi) + {\Zmin^ - Zmin) ■ ■ r + 0{r‘^) = 

i=\ZmiV 

b^axJ-1 

= ^ vom) + 0{r^) . 

r ^mfnl 



vol{Tt') 
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Since 

vol{Vi) = P{S2{{)) - vol{V'), 

where 82(1) and are defined as below 

52(1) = {(x,y) I (x-a)^ + (j/-&)^ <r^, ■ y'^ > i} 



and 

V^ = ^{x,y,z) I (a: — a)^ + (y — 5 )^ < r^, x>i, x^ ■ y'^ < z < i + 

Because the projections (on xy-plane) of V( do not overlap and belong to the 
the circle (x — a)^ + (j/ — &)^ < r^, ve have: 

J 1 

^ vol{Vl) = 0 {r^) . 

i = D,„inl 

Further, 



^ vol{Vi)= ^ (^dmo,o(-S'2(t)) + e> ^ vol{V') = 



= ^ dmo , o { S 2 { i )) + O (aP • . 

r ^ min ~\ 

{^max ^min — 0{aP ■ 6«) is used.) 

So, the volume of H' can be expressed as 

L-2^Tnaa; J 1 

vol{H') = Yl dmo,o{S 2 {i)) + 0 ( 0 ^ • • r*+")) . 

r ^ min ~\ 

The equality vol{H") = Zmin • tt • gives: 
vol(H) = vol{Ti.') + vol{T-L") = 

L-^maa; J 1 

= 'Y dmo,o{S2{i) + 0{aP ■ + Zmin • tt • = 

r ^ miri \ 

L-^max J 1 

= ^ dmo.o(^2(z)) + 0(a^’ • • r*+-) . 

2=1 



Notice that 



L^TTidxJ 1 

Y dmofi{S2{i)) 

2=1 
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expresses the number of digital points in Q excluding the (possible) points be- 
longing to the plane z = [ Zmax^ ■ So, the set of the following equalities 

mp^q{C) = vol{Q) = dmo^ofi{G)+0 ■ b‘^ ■ rn+=^ = dmp^q{C)+0{aF -Ifl 

is proved. QED 



4 An application and conclusion 



We illustrate a possible application of the obtained results by the solution of a 
problem which is (in details) considered in [5] . The problem is: 

How efficiently an original circle can he reconstructed from the corresponded 

digital data ? 

The answer is that the the center position and the diameter of the original 
circle C can be reconstructed with an error upper bounded with O 

hestf — 



if the radius is estimated as rest = \J , while 

( dmo’o(C) ’ dmo’o(C) ) estimates the center position. Numerical data strongly 
confirm the obtained result. A few examples are given in the next table ([5]). 



Table 1. Absolute errors between the original values of the radius and coordinates of 
the center (r, a and h, respectively) and their estimations (oest, best and Cest) from the 
digitization of the circle C : (x — a)^ + {y — b)^ < r^. The discrete moments: dmo,o(C), 
dm\fi{C) and dmifi{C) are used. 



r 


a 


b 


rest - r 


Otest 


best b 


2.5 


9.4 


6.2 


-0.04075 


-0.03157 


0.01052 


282.8 


33.3 


0.02313 


-0.05001 


-0.04999 


2444422.4 


33222.2 


-0.04075 


-0.03157 
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Extensions of the result to other shapes (different from circles) are also pos- 
sible. Let us mention here that the moments have been widely used in shape 
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recognition and identification. That is a reason for a large number of papers 
related to the (discrete) moments calculation from a picture (for example [3]). 
This paper gives an apriory knowledge about the precision in the estimation of 
(real) moments from the corresponding (digital) picture. On the other side, the 
digital circles and the digital circular arcs are on of the basic objects considered 
in image processing and shape analys. For more results we refer to [4]. 

The results of this paper are based on the Huxley’s result. This is a strong 
result which is related to the number of integer points inside a smooth enough 
convex curve 7 . This is an old mathematical problem. Gauss and Dirichlet knew 
that the area of the shape bounded with 7 estimate this number within order 
0{s), where s is the length of 7 . The situation when 7 is a circle is mostly studied. 
The Huxley’s result (1) improves the previously best known upper bound ([2]) 
even in this case. ^From the given proofs of Theorem 2 and Theorem 3 it can be 
concluded that the derived result is sharp up to Huxley’s result. 

References 

1 . Huxley, M. N.: Exponential Sums and Lattice Points. Proc. London Math. Soc. 3 
(1990) 471-502. 

2. Iwaniec, H., Mozzochi, C. J.: On the Divisor and Circles Problems. J. Number 
Theory 29 (1988) 60-93. 

3. Jiang, X., Bunke, H.: Simple and Fast Computation of Moments. Pattern Recogni- 
tion 24 (1991) 801-806. 

4. Worring, M., Smeulders, A.W.M.: Digitized Circular Arcs: Characterization and 
Parameter Esstimation. IEEE Trans. PAMI 17 (1995) 587-598. 

5. Zunic, J., N. Sladoje, N.: A Characterization of Digital Disks by Discrete Moments. 
Lecture Notes in Computer Science: Computer Analysis of Images and Patterns 
1296 (1997) 582-589. 




Graceful Planes and Thin Tunnel-Free Meshes 



Valentin E. Brimkov and Reneta P. Barneva 



Eastern Mediterranean University, 
Famagusta, TRNC, Via Mersin 10, Turkey 
{brimkov , barneva} . asSmozart . emu . edu . tr 



Abstract. In this paper we present an approach to describe polyhedra 
by meshes of discrete triangles. The study is based on the theory of 
arithmetic geometry [10]. We introduce a new class of discrete planes 
(respectively lines) which we call graceful planes (respectively graceful 
lines). We nse naive planes and gracefnl lines to obtain as thin as possible 
triangnlar mesh discretization admitting an analytical description. The 
interiors of the triangles are portions of naive planes, while the sides are 
gracefnl lines. 

Key words: Discrete 3D modeling, Discrete planes, Discrete lines, Dis- 
crete triangles, Mesh of triangles. 



1 Introduction 

A promising approach in discrete geometry for computer imagery is the one 
based on arithmetic geometry^ . Reveilles observed in his pioneer work [10] that 
discrete straight lines in the plane can be analytically defined through a dou- 
ble linear Diophantine inequality of the form 0 < ax + by + < u, where the 

parameters are all integers. The parameter w > 0 is interpreted as a thickness 
of the discrete line, while fi is the internal translation constant which measures 
the shift of the line with respect to the origin. Subsequently this definition has 
been naturally extended to define discrete planes 0 < ax + by + cz + fj, < u), 
where the parameters /i and tv have the same interpretation. In recent years, 
arithmetic geometry has been developed by many authors (see for example the 
bibliography in [9]). The main objective is to obtain simple analytical descrip- 
tion of the basic Euclidean primitives (lines, segments, triangles, circles, planes, 
spheres, etc.) and, on this basis, to create tools for efficient modeling of more 
sophisticated objects composed by such primitives. (See [2,1] for related discus- 
sion.) The approach of arithmetic geometry is very successful when dealing with 
linear objects, such as lines, planes, or their portions. This is of importance from 
a practical point of view since for various applications, it is sufficient to obtain 
good polyhedral approximation to a given real object. Thus it is an important 
task to study the properties of the above mentioned linear primitives, and to 
develop new methods for modeling through such primitives. 

^ Sometimes the terms “analytic discrete geometry” or “digital geometry” are used 
instead. 
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In 3D modeling the discrete planes play a central role. The discretized objects 
are usually represented as meshes of polygonal (usually triangular) patches of 
discrete planes. A principal requirement for the obtained discretization is to be 
tunnel-free, i.e., to have no hole of certain type. The best studied classes of 
discrete planes are the naive planes (with thickness to = max{|a|, |6|, |c|}), and 
the standard planes (with thickness uj = |a| + |6| + |c|). These two classes possess 
interesting and useful properties. For instance, the naive planes are the thinnest 
possible tunnel-free discrete planes [1]. However, naive planes have almost never 
been used for modeling purposes, due to certain theoretical obstacles which are 
difficult to overcome. For example, the intersection of two naive planes may be 
a set which is too far from the intuitive idea of line and, in fact, it may be a 
disconnected set of voxels (see corresponding examples in [7,5]). 

In a recent work [5] we have studied the possibilities of using naive planes 
for triangular mesh discretization. We have proposed a general discretization 
scheme and, within its framework, we have shown how to obtain a thin tunnel- 
free approximation to a mesh of two triangles with a common side. This solution 
might be quite satisfactory regarding any practical purposes. However, in one 
particular degenerate case the solution is necessarily algorithmic rather than an- 
alytical. In the present paper we continue and extend our investigation with the 
aim to obtain a way for truly analytical discrete modeling, through the thinnest 
possible discrete planes. 

We approach the problems of generating 3D polygons (triangles, for definite- 
ness), segments of 3D discrete lines, as well as meshes of 3D discrete triangles. 
For this purpose, we define a new class of discrete planes which we call grace- 
ful planes (Section 3). The latter can be classified between the naive and the 
standard planes, and have the important property of being the thinnest possible 
discrete planes such that any (appropriately defined) “linear segment” of any 
graceful plane is a connected set of voxels. On the basis of graceful planes we de- 
fine 3D discrete lines, called graceful lines, which are portions of graceful planes. 
One can use graceful lines to approximate the sides of 3D triangle, while the 
rest (interior) of the triangle can be modeled through naive planes (Section 4). 
Thus we obtain a thin tunnel-free triangular mesh discretization with analytical 
description (Section 5). We conclude with some remarks in Section 6. 



2 Preliminaries 

In this section we recall some definitions and facts from discrete geometry, which 
will be used in the sequel. For a detailed account of the matter, the reader is 
referred to [10,5]. 



2.1 Definitions and Facts 

Throughout \x\ denotes the greatest integer not exceeding the real number x, 
while {^} = p — <?LgJ Euclidean remainder of the rational number 
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Discrete coordinate plane/ space consists of unit squares/cubes, called pix- 
els/voxels, centered on the integer points of the Cartesian coordinate system 
in the plane/space, and with edges parallel to the coordinate axes. In the dis- 
crete coordinate plane two pixels are /-adjacent if they share a common side, 
and they are 8-adjacent if they have a common side or vertex. A discrete line 
L{a,b, p,,uj) Q < ax by pi < oj \s naive if w = max{|a|, |6|}, standard if 
w = |a| -I- \b\, and *-connected if max{|a|,|6|} < lo < |a| -I- |6| [10]. If a and 
b are both nonzero, then any two neighboring pixels of a standard line are 4- 
connected, while a ^-connected line has always two pixels which are 8-adjacent 
but not 4-adjacent. 

Any 2D naive line L is functional on one of the coordinate axes, i.e., for 
every fixed x (every fixed t/) G Z there exists exactly one pixel belonging to 
L. A special case of naive line providing the best discrete approximation to 
the corresponding continuous straight line among all the possible naive lines 
is the Bresenham line [4]. This line admits an analytical description. Given an 
Euclidean line ax -\- by -\- c = the relevant Bresenham line can be defined as a 
naive line L(a, b,c-\- [^J , w), where to = max{|a|, |6|}. 

Two discrete lines are called equivalent if they are equal within translation. 

In the 3D space, two voxels are 6-adjacent if they share a common face. The 
voxels are 1 8-adjacent if they have a common edge or face. They are 26-adjacent 
if they have a common vertex, edge, or face. A sequence of voxels is a k-path 
{k = 6, 18, 26) if every two consecutive voxels along the sequence are fc-adjacent. 
Two voxels are k-connected {k = 6, 18, 26) if there exists a fc-path between them. 
A set of voxels is connected if there exists at least a 26-path between every two 
voxels. Otherwise it is disconnected. 

The plane P{a, b, c, pt,uj) : 0 < ax-\-by-\-cz-\-p, < u> has k-tunnel {k = 6, 18, 26) 
if there exist two /c-adjacent voxels A{xA,yA: za) and B^xb^Pb, Zb) such that 
axA+byA+czA+y < 0 and axB+byB+cZB+y > to. A discrete plane without any 
/c-tunnels is said to be k-tunnel- free. For our purposes, we also need a definition 
of 6-tunnels and 6-tunnel freedom in finite sets of voxels. Such a definition can 
be given in terms of algebraic topology. Specifically, one can consider a voxel as 
a closed unit cube centered at a point in the 3D space with integer coordinates. 
Given a finite set of voxels S, let US' denote the polyhedron obtained as a union 
of the voxels in S. We say that S has 6-tunnels if US is not a simply connected 
set. Otherwise S is 6-tunnel free [5]. Usually a 6-tunnel-free set is called tunnel- 
free for short. In a similar (although somewhat more involved) way one can also 
define 18- and 26-tunnels. 

A discrete plane P = P{a, b, c, pi, to) is functional on a coordinate plane, say, 
Oxy, if for any pixel (x,y) from Oxy there is exactly one voxel belonging to P. 
The plane Oxy is called functional plane for P(a, b, c, /i, to) and denoted by np. 
It is a well known fact that a naive plane is functional on at least one of the 
coordinate planes Oxy, Oyz, or Ozx [7]. Moreover, if P{a,b,c, pi,to) is a naive 
plane with jcj = max(|a|, |6|, jcj) then Tip = Oxy. 
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Finally, we recall that a naive 3D digital line [8] is defined as intersection of 
two discrete planes which is 26-connected set of voxels and such that removing 
any voxel separates the line into two 26-connected components. 



2.2 “Naive” Triangles 

We first define a triangle in the 2D plane. Given an Euclidean triangle AABC 
in Oxy, the corresponding 2D discrete triangle T = T 2 d{A, B, C) is determined 
as follows. T consists of a border which is the set of pixels from the Bresenham 
line segments between the vertices A, B, and C of T 2 d{A, B, C), and an interior 
which is the set of all pixels whose centers belong to the interior of AABC but 
do not belong to the border. 

Three-dimensional (“naive”) discrete triangle T{A, B,C) is defined as a por- 
tion of a special kind of naive plane, called regular plane and denoted , 

passing through the centers of three given voxels A, B, and C - vertices of 
the triangle. The projection of T{A, B,C) over iTpABc is a 2D discrete trian- 
gle T 2 d{A' , B' ,C), where A', B' , and C are the respective projections of A, B, 
and C onto TVpABc. The regular plane is centered about the corresponding 

continuous plane through A, B, and C.lfV: ax by -\-cz-\-d=0,0<a<b<c 
is a continuous plane in the 3D space, then the corresponding regular plane is 
the naive plane Pp = P{a,b,c,d-\- [^\,uj). 

A side of T{A,B,C), say, AB, is the set of voxels whose projections on 
IT p ABC constitute the segment of Bresenham line defined by A' and B'. AB can 
be considered as a sort of discrete 3D line, which we call pseudoline [5]. 



2.3 Generation Scheme 

The 3D discrete triangles defined above can be obtained through a simple gener- 
ation scheme. Let A, B, and C be a triple of points in the 3D space with integer 
coordinates, and let us consider them as vertices of a 3D discrete triangle. Our 
generating method adheres to the following scheme. 

Generation Scheme 

1. Determine the regular plane Pp^^ from which the discrete triangle is a 
portion. 

2. Find its functional plane iTpABc. 

3. Find the projections A', B' , and C of the vertices A, B, and C onto 

TT pABC . 

4. Compute the 2D triangle with vertices A', B', and C in TTpABc. 

5. Generate the 3D triangle in P from its projection over npABc. 

Clearly, this scheme is readily adapted for generation of other 3D Euclidean 
primitives, such as lines, segments, arbitrary polygons, etc. It can also be easily 
modified for planes which are thicker than naive and thus not functional on any 
coordinate plane. Such a modification has been suggested in [5]. Specifically, a 
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discrete plane P with thickness u) greater than c is considered as pseudofunctional 
on the coordinate plane Oxy corresponding to the largest coefficient c. Then 
the pseudofunctional plane can be used in the generation scheme instead of 
functional plane. 

Let us mention that a similar way of generating the supercover of an Eu- 
clidean triangle (that is the set of all voxels intersected by the triangle) has been 
presented in [3]. 



2.4 “Naive” Triangular Meshes 

Let us consider four voxels A, B, C, and D, such that A, B, and C form one 
triangular patch, and A, B, and D form another triangular patch. The pair of 
the edge-connected triangles A(A, B, C) and A(A, B, D) is called a mesh of two 
triangles. 

Let P^^'^ and P^^^ be the regular planes determined by the triples A, B, 
C and A, B, D, respectively. Using the generation scheme from the preceding 
section, one can determine two discrete triangles T{A, B,C) and T{A, B, D), 
belonging to Pfi^'^ and P^^^ , respectively, and forming a mesh of discrete 
(“naive”) triangles. 

In has been noticed in [5] that the borders of a discrete triangle, as well as 
the discrete triangle itself, might be a disconnected set of voxels. If P^^'^ and 
Pf)^^ are functional on the same coordinate plane, and both T{A,B,C) and 
T{A, B, D) are disconnected, the corresponding mesh may be disconnected, too. 
A disconnected discrete triangle (disconnected mesh of discrete triangles) admits 
a simple topological characterization provided by the following lemma. 

Lemma 1. [5] Let the discrete triangle T = T{A, B, C) be a portion of a discrete 
plane which is functional on Oxy. Then T is disconnected if and only if there is 
at least one pair of voxels vi, V 2 G T such that the following conditions are met: 

(a) There exists a vertex ofT, say A, such that the projections of v\ and V 2 
onto Oxy belong to the Bresenham lines A'B' and A'C' in Oxy, determined by 
the corresponding projections A' , B' , and C of A, B, and C. 

(b) Ifvi = {x',y',z') and V 2 = {x” ,y” , z”) , then \x' — x”\ = 1, \y' — y”\ = 1, 
and \z' — z!'\ = 2; 

(c) Without loss of generality, suppose thatvi = {x' ,y' ,z'), V 2 = {x' +l,y' + 
1, A-l-2). ThenT 2 D = T 2 d{A' , B' ,C) does not contain the pixels p' = (x',y' + l) 
and p" = (x' -I- 1, y'). 

A pair of voxels V\ , V 2 satisfying the conditions of Lemma 1 is called point of 
disconnection of the discrete triangle T. A pair of voxels v\, V 2 which belong to a 
pseudoline g and satisfy condition (b) of Lemma 1, is called point of disconnection 
of g. Clearly, a point of disconnection of T is a point of disconnection of two of 
the sides of T (which are pseudolines, by definition). 
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3 Graceful Planes and Lines 

In this section we introduce an important class of planes which will play a 
crucial role in our modeling methods. Throughout, for a given discrete plane 
P{a, b, c, n, Lo), we will assume for the sake of simplicity that 0 < a < b < c. We 
can also assume without loss of generality that a, b, and c are relatively prime. 
We begin with a few remarks. 

Assume first that P has a thickness oj < c, i.e., P is “thinner” than a naive 
plane. Then P may contain tunnels and, in general, may be a disconnected set of 
voxels. Therefore in constructing methods for discretization we will be interested 
in classes of planes of thickness co > c. 

Let P be of thickness to = c, i.e., a naive plane. Then P is functional on 
Oxy. It is well known (see, e.g., [6,5]) that P may contain a configuration of the 
form shown on Fig. 1. In this configuration there are two voxels v and v', which 
correspond to pixels p and p' having a common vertex, but the z-coordinates of 
V and v' differ by 2. We will call such a pair of voxels in a discrete plane a jump. 




Fig. 1 •Possible configuration of voxels in a naive plane. 



Now let P be of thickness cu > c, i.e., “thicker” than a naive plane. It means 
that P will not be functional on Oxy, i.e., for certain pixels (x, y) of Oxy, P will 
contain at least two voxels one on the top of the other. If for some pixel {x, y) 
there are exactly two voxels v = {x, y, z) and v' = {x, y, z + 1) belonging to P, 
we will call the pair v,v' a tandem. 

The following fundamental lemma will be our basic tool in developing dis- 
cretization methods. 

Lemma 2. A naive plane P{a, b, c, p, oj) has jumps if and only if c < a + b. 

Proof. To prove the stated result, we will show that: If c < a+b than the discrete 
plane P has jumps; If c > a -I- & then P does not have any jumps. 

Given a naive plane P{a, b,c, p,u>) : 0 < ax + by + cz + p < to, consider a 
fixed value of z: z = Zq. For this value of z, the above double inequality becomes 
0 < ax+by+czo+p < lu, which is an equation of a discrete line Lz^ with thickness 
Lo and translation constant po = p + czq. It is clear that the naive plane P has 
jumps if and only if for some zq the line Lz^ is thinner than standard (see Fig. 
2a). 
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Fig. 2. Illustration to the notions jump, graceful plane, and graceful line. In figures 
a) and b) every pixel is marked by the corresponding 2 -coordinate, as the lower-left 
corner corresponds to the voxel (0, 0, 0). a) Naive plane P(4, 5, —8, 4, 8). For 2 = 4 the 
corresponding discrete line in Oxy contains two 8-adjacent pixels, and at that point 
the plane has a jump between levels 3 and 5. b) Graceful plane P(4, 5, —8, 4, 9). The 
pixels corresponding to tandems of voxels are marked by asterisk. Denotation fc* means 
that there are two voxels corresponding to that pixel, with 2 -coordinates k and fc -I- 1, 
respectively. Apparently, these voxels form a lattice, c) Graceful line. It is a part of the 
given graceful plane P(4, 5, — 8, 4, 9), and its projection is a naive line marked in Fig. 
2b). Note that the pseudoline which could be built in the corresponding naive plane 
P(4, 5, —8, 4, 8) would be disconnected, as the voxel in dark gray would be missing. 



Consider first the trivial case gcd{a, b) = 1. Under this condition, all the 
lines of the form 0 < ax + by + czq + y < to are equivalent^. Therefore it 
suffices to consider an arbitrary such line. If to = c<a + b then the line 0 < 
ax + by + czq + ij, < to is ^-connected and hence it contains pixels pi and p2 which 
are 8-adjacent but are not 4-adjacent. Let p[ and p'2 be the pixels for which both 
Pi and p2 are neighbors. Then the voxels from P corresponding to Pi and p'2 
expose a jump. 

If to = c = a + b then the line 0 < ax + by + czq + po < to is a, standard line; 
if to = c>a + b then it is a line thicker than standard. Clearly, in both cases 
the plane P does not have any jumps. 

Consider now the case when gcd(a,b) = d yf 1. Let us fix a and b. Consider 
a naive plane P{a,b,c, p,to) with w = c < a + b, i.e., c = a + b — k, where 

^ Theorem 3 [7] says that the intersection of a discrete plane P{a,b,c, p,to) with the 
plane 2 = 2 q is a set of discrete lines; furthermore, for all integer values 2 q these 
discrete lines are equivalent. We notice that the second assertion of the theorem 
holds only if gcd{a, b) — 1, while for arbitrary a, b, and c with gcd{a, b,c) = 1 such a 
statement is not valid. 
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1 < k < a. Without loss of generality we consider only the extreme case k = 1, 
i.e., c=a + 6 — 1. In fact, all the planes P{a, b,a + b — k, 1 < k < a, can 
be considered as obtained from the plane P{a, b,a + b — 1, ^,u>) through cutting 
certain voxels. Obviously, this cannot lead to any loss of jumps. 

The double inequality 0 < ax+by+cz+n < u) converts into 0 < ax+by+{a+ 
b—l)z + y, < a+b—1. For any fixed z this is a discrete line Lz{a,b, y! ,tS) with a 
translation constant y' = {a + b—1) + y. We know from [3] that this discrete line 
is equivalent to a discrete line L^{a' ,b' , y ” where a' = |, 6 ' = y” = 
uj' = +1 if {^} > {^}, and w' = else. More in detail, is given by one 

of the following two double inequalities: 0 <|x+^y + 
or 0 < + 1. 

In the latter case we have a standard line since its thickness is equal to 

Regarding the former case, we will show that one can always find an integer 
z for which 

and thus the first one of the above two cases holds. We have that { } = d—1. 

Since any rational number modulo d cannot exceed d—1, it follows that the only 
possibility is to have equality in (1). 

Let us set m = a + b — 1. We look for z for which mz + y = m(mod d), i.e., 
mz = m — y{mod d). It is a well known fact from number theory that the above 
equation has a solution if and only if gcd{m, d) divides m — y. But it is so since 
d = gcd{a, b), hence gcd{m, d) divides a and b. Then, in order to divide also m = 
a + b—1, gcd{m, d) must be equal to 1. Thus we obtained that the equation (1) 
has always an integral solution zq. Since is integer, we have for the thickness 
of the corresponding discrete line — 1 < 3 + 3 

= a' + b' , i.e., for z = zq the discrete line is ^-connected and thus the discrete 
plane P has jumps. 

If c > a + b, i.e., c = a + 6 + A: for some integer k > 0, then clearly the 
corresponding discrete line 0 < ax + by + {a + b + k)z + y < a + b+ k ior fixed 
z will always be standard or thicker than standard, and therefore the plane P 
will have no jumps. □ 



Remark 3. Clearly if w < c then the plane may have either tunnels or both 
jumps and tunnels. It is also easy to see that if c<w<a+ & then the plane has 
both jumps and tandems. 

In the framework of our generation scheme, we are concerned with the study 
of possible disconnections of lines, triangles, and meshes. While Lemma 1 pro- 
vides a structural description of the possible disconnections, the following lem- 
mas provide a corresponding analytical characterization. 

Lemma 4. Let P{a, b, c, y, to) be a discrete plane and let AB be a discrete seg- 
ment in P, obtained through the generation scheme applied to a naive discrete 
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line A'B', where A' and B' are the projections onto Tip of A and B, respectively. 
Then AB can be disconnected if and only if to < max{a -I- &, c}. 

Proof. If w < max{a -I- b, c} then according to Lemma 2 P has jumps. Given a 
jump V, v' in P, let p,p' be the corresponding pixels in np. If now we choose any 
naive line in np including the pixels p and p' , then obviously the corresponding 
discrete line segment in P will be disconnected. 

The “only if’ part follows from the observation that a pseudoline in P may 
be disconnected only if P has jumps. □ 

Lemma 5. Let a discrete plane P{a,b,c, p,,to) be given. A 3D discrete triangle 
in P can be disconnected if and only if to < max{a -I- 6, c}. 

The proof is analogous to that one of Lemma 4. It is not difficult to see 
that Lemma 5 is still valid for polygons with arbitrary many vertices. The above 
discussion and the obtained properties lead us to introducing the following def- 
inition. 

Definition 6. A discrete plane P{a, b, c, p,, to) with thickness to = max{a -I- 6, c} 
is called a graceful plane. 

The class of the graceful planes lies between the class of the naive and the 
standard planes. Note that if the maximum in the above definition is reached for 
to = c then the graceful plane is a naive plane. Otherwise the graceful plane is a 
naive plane which in addition contains tandems. More precisely, for each value of 
to between c-|-l and a+b a lattice of voxels is added. These lattices are equivalent 
and they belong to different parallel Euclidean planes, in correspondence to the 
concrete value of w. As a result, tandems appear in such a way that the obtained 
graceful plane is jump-free, and it is the thinnest possible discrete plane with 
this property. (See Fig. 2b). 

Lemmas 4 and 5 show that the graceful planes are the thinnest possible 
discrete planes for which, in the framework of the generation scheme. Euclidean 
primitives like lines, segments and triangles (as well as arbitrary polygons) are 
always connected sets of voxels. This is an important property to be used further 
for modeling purposes. 

Using the generation scheme, one can define a discrete line in a graceful plane. 
We will call such a line graceful line. (See Fig. 2c). 

It is not difficult to see that any graceful line is either a naive 3D digital 
line or, otherwise, it may contain tandems. In contrast, the pseudolines defined 
earlier, which are obtained in the same way but on the basis of naive planes, may 
be connected or disconnected sets of voxels. In the former case they are naive 
3D digital lines. In the latter case they can be considered as naive 3D digital 
lines with points of disconnection. 

4 Modeling of 3D Triangles 

As mentioned, the graceful planes can be used for modeling of Euclidean prim- 
itives. Based on the generation scheme, one can obtain connected and, thus. 
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tunnel-free discretizations of segments and triangles. Regarding triangles or other 
polygons, the method can be refined in the following way. 

Graceful sides Naive interior (GN-method) 

1 . The sides of the triangle are approximated through the corresponding grace- 
ful lines. 

2 . The interior of the triangle is approximated through the corresponding por- 
tion of the relevant regular plane. 

According to [ 5 ], a naive discrete triangle has tunnels only if it is discon- 
nected. Keeping this in mind and using Lemma 5 , it is easy to see that the 
discrete triangle obtained through the GN-method is connected and tunnel-free. 
Another way for obtaining thin triangular approximation is the following. 

Standard-Naive sides Naive interior (SNN-method) 

1 . The projection of the sides onto the functional plane are standard lines, while 
the sides themselves are portions of naive planes. 

2 . The interior of the triangle is approximated through the corresponding por- 
tion of the relevant regular plane. 

In view of Lemma 1 , one can easily deduce that the obtained discrete polygon 
is connected and tunnel-free. 



5 Triangular Meshes Modeling 

We start this section with a fact similar to Lemmas 4 and 5 . 

Proposition 7. Let a mesh of two discrete triangles T\ and T2 be given, such 
that they are parts of discrete planes Pi{ai, 61, Ci, /Zi, Wi) and ^2(^2: C2, /T2, W2), 

respectively, with uji > ci and u>2 > C2. Then the mesh can be disconnected if 
and only if u>i < maxjai -I- bi, ci} and 0J2 < max{o2 -I- 62, C2}. 

The proof is similar to those of Lemmas 4 and 5 . This proposition shows that 
the generation scheme applied directly to discrete planes thinner than graceful 
does not always produce a tunnel- free discretization. Below we present two ap- 
proaches for obtaining thin discrete tunnel- free triangular meshes. The methods 
rely on appropriate modifications of the generation scheme and use the methods 
for triangle modeling given in Section 4 . The following proposition provides an 
approach for tunnel-free triangular mesh modeling. 

Proposition 8. Let a mesh of two discrete triangles T\ and T2 be constructed 
according to the generation scheme in such a way that they are portions of grace- 
ful planes. Then the mesh is connected and tunnel-free. 
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The proof is analogous to one of Theorem 9. 

The method based on Proposition 8 can be refined on the basis of the GN- 
method for triangle discretization. Accordingly, every triangle is approximated 
through the GN-method. We have the following theorem (which, in particular, 
implies Proposition 8). 

Theorem 9. Discrete triangular mesh obtained applying the GN-method to all 
triangular patches, is tunnel-free. 

Proof. Gonsider a mesh of two discrete triangles Ti = Ti{A, B,C) and T 2 = 
T 2 {A, B,C) obtained through the GN-method, in the framework of the gener- 
ation scheme. Assume first that and P^^^ are functional on the same 

coordinate plane. In view of Lemma 5, each of the discrete triangles Ti and T 2 
is connected and tunnel-free. Moreover, both triangles have the same common 
side and therefore the mesh Ti U T 2 is tunnel-free, too. 

Gonsider now the case where P^^^ and P^^^ are functional on different 
coordinate planes. Similar to the previous case, both discrete triangles are con- 
nected and tunnel-free. It has been shown in [5] (Theorem 5) that, under the 
latter condition, the mesh of the corresponding naive discrete triangles is con- 
nected and tunnel-free. Hence the mesh T\ U T 2 of triangles with naive interiors 
and graceful sides will be tunnel-free, as well. □ 

Another approach of obtaining thin triangular mesh approximation can be 
based on the SNN-method. Accordingly, every triangular patch is approximated 
through the SNN-method. We have the following theorem. 

Theorem 10. Discrete triangular mesh obtained applying the SNN-method to 
all triangular patches, is tunnel-free. 

Proof Let Ti = Ti(A, B, C) C P^^^ and T 2 = T 2 {A, B, D) C P^^^ be discrete 
triangles obtained through the SNN-method. In the case when P^^^ and P^^^ 
are functional on different coordinate planes the proof is identical to that one of 
Theorem 9. 

Gonsider the case when P^^'^ and P^^^ are functional on the same co- 
ordinate plane. Since the projections of the triangles’ sides onto the common 
coordinate plane are standard lines, then it is easy to see that each of the dis- 
crete triangles Ti and T 2 fails the condition (c) of Lemma 1. Thus Ti and T 2 are 
connected and tunnel-free, and they have a common side AB. Gonsequently, the 
mesh Ti U T 2 will be tunnel-free. □ 

It is clear that a discrete side can be analytically described through two 
double linear inequalities of the form 0 < ax -\- by -\- cz -\- p, < uj and 0 < 
a'x -\- b'y -\- p' <u}' . 

It is also easy to see that the presented generation methods can be organized 
in such a way that their time complexity will be linear with respect to the 
number of voxels generated, which can be represented in a 2D array. (See [3,5] 
for a detailed discussion on a similar matter.) 
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6 Concluding Remarks 

In this paper we have defined a new class of discrete planes “sandwiched” be- 
tween the naive and standard planes. On the basis of these planes, one can 
construct thin discrete triangular tunnel-free meshes admitting analytical de- 
scription. 

It would be interesting to look for other reasonable discretization methods 
providing optimally thin tunnel-free approximation to a polyhedral surface. It 
would be also a challenging task to develop relevant methods for obtaining thin 
discretizations of various 2D curves (e.g., the conic sections) and 3D surfaces 
(e.g., the quadratic surfaces). 
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Abstract. The aim of this article is to provide some arithmetical tools 
in order to study the local properties of digital hyperplanes. 

With the help of the new general notion of configuration, we investi- 
gate the arrangement of the different combinatorial structures contained 
in a digital hyperplane. The regularity of this deployment is controlled 
by two arithmetical functions that we call code^j) and boundary(^iy By 
using these two simple tools, we prove that the local configurations in 
a functional digital hyperplane only depends on its normal vector and 
that their number is less than the size of the chosen neighborhood. 
Keywords: windows, local conhgurations, digital hyperplanes. 



1 Introduction 

The fact that such classical mathematical tools as differentiability are totally 
ineffective in digital geometry has aroused the curiosity of more than one author 
especially in DGCI conferences. Some of the mathematical tools designed to 
study the digital functions mapping 1? to Z include the notions of tricubes 
introduced by Isabelle Debled-Renesson [GV97,D95,RY97,S97], of n-m local 
configurations [R91,V99] or under another name the n-m- cubes. They are both 
built on the same principle: one looks at the functions through a window that we 
translate and which is rectangular and 3 x 3 in size for the tricubes or m x n in size 
for the m-n local configurations. But why should we only consider rectangular 
windows? We could as well look at a function through a triangular window or 
even through any window and that is the idea we are going to develop here. 
In this paper, we take any window and in order to justify this choice, we will 
generalize the theorem of J-P Reveilles where he proves that a naive plane 
contains at most nm different n-m cubes [R95,V99]. In fact, this result is a 
direct consequence of a general analysis inspired by the integer part function. 
The keypoint is that we reduce these multidimensional combinatorial problems 
to unidimensional questions by using a fundamental lemma called trampoline 
and two functions called codc(^j) and boundary (^jy Finally, we will illustrate the 
use of these tools by sketching a proof of the well-known fact that there exist 
exactly 40 tricubes. 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 65—75, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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2 Local configurations 

The purpose of this section is to introduce the general notion of a local config- 
uration of a map from a group into a group. At first, we introduce a definition 
for the windows of a group. 

Definition 2.1 .—Let G be a group, a finite subset of G eontaining the neutral 
element is ealled a window o/G. The eardinality of the subset is ealled the size 
of the window. 

The subset {0 ; 3.65 ; 4.5 ; — 6} is an example of window of the additive group 
K of size 4. In this paper, the windows are denoted W , their size s and we denote 
their elements w^, w‘^,...,w'^,...,w^ so that the neutral element of the window is 
always the first on the list = Og, neutral element of G). The indices of 
the elements of the window are written as exponents so that we will be able to 
denote ru* the coordinates of ic* in the case where the group G is K" or Z". 
Now, let G and G' be two groups, W = ■ ■ ■ , w®} a window of G 

of size s and / a function mapping G to G' . We define the local configurations 
through W of the function /. 

Definition 2.2 .— The local configuration through W of the function i at a 
point X belonging to G is the element o/(G')®.’ 

(f(x + wl) -f(x),f(x + w2) -f(x),---,f(x+W®) -f(x)) . 

As = 0, the first value f{x + w^) — f{x) of the s-uple is always the neutral 
of G'. 

If the function maps Z"~^ to Z, this notion can be intended for use in computing 
something like the normal direction of digital objects (a classical problem in 
imagery). For the investigation of such maps, we will be inspired by the local 
configurations through a window W of a classical map: the integer part function. 

3 The integer part function 

We recall that the integer part function [ ] : M ^ Z is defined by [x] = max{y G 
^ I y < a;}- We denote also {x} the fractional part of x such that x =[x\ + {x} 
or again with a different notation {x} = a; mod 1. The integer part function is a 
map sending the additive group IR to the additive group Z. We study it because 
it performs a transition between continuous and digital worlds. We are going 
to investigate its local configurations through a window W by using a simple 
lemma. 

3.1 Jump- lemma 

Let X and w be two real numbers. 

Lemma 3.1 .— (a preliminary version of jump-lemma)— We have 
[x -I- w] — [x] = [w] if {x} < 1 — {w} 

= [w] -I- 1 otherwise. 
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The proof uses nothing else than inequalities. The consequence of this lemma is 
that for a given real w, the difference [x + ru] — [a;] can only take two different 
values: [ru] or [w+ 1]. The lemma specifies when the difference takes one or other 
possible value : it depends on the comparison of {x} with 1 — {tc}. To represent 
these two “separated” values, we will introduce some new notations: 

Definition 3.1 We define the maps code : R ^ [0, 1[ by code(x) = {x} and 
boundary : R — >]0, 1] by boundary(w) = 1 — {w}. 

These two functions are closely related to the functions boundary(^j) and codcf^i) 
that we will use later in order to investigate the local configurations of digital 
hyperplanes. With these notations, jump-lemma can be written out: 

Lemma 3.2 (Jump-lemma)— We have 
[x -I- w] — [x] = [w] if code(x) < boundary(w) 

= [w] -I- 1 otherwise. 

For a given real w and any real x, the value taken by [x -\-w] — [x] only depends 
on the comparison of the value code{x) with the value boundary{w) and it is the 
keypoint in our investigations into the local configurations of the integer part 
function. 



3.2 The local configurations of the integer part function 

Let (c0i<*<« be the local configuration of the integer part function at a point x 
through a given window W = • • • , of R. By definition, the value c* 

is [x-kw*] — [x] and we could compute it directly. According to the jump-lemma, 
c* is equal to [w®] if code{x) < boundary{w'‘) and to [w*] -I- 1 otherwise. It shows 
that the local configuration at point x only depends on the values [rc*] and on 
the comparison of code{x) with the s values boundary(w^). These s numbers 
boundary {w’') can be sorted and so ordered, they partition the intervall [0, 1[ 
into s regions which are closed on the left and opened on the right side (when 
two boundaries are equal, the region that they surround is empty). To sum up, 
we can say that for a given window W, the local configuration at a point x 
through the window W only depends on the region where there is code{x). Then 
each region corresponds to a local configuration and the local configuration at x 
is the local configuration associated with the region of code{x). We will illustrate 
this argument by an example. 

Example.— Let us describe the local configurations of the function integer part 
through the window of reals W = = 0; = 5,6; = 2,45; = 3,75; 

= —1,8}. By definition, the local configuration of the integer part function 
at the point x through W is 

{[x -k 0] — [x\] [x -k 5, 6] — [x]; [x -k 2, 45] — [xj; [x -k 3, 75] — [x]; [x — 1, 8] — [x]). 

The boundaries associated with the window W are the values boundary{w^) = 1; 
boundary{w^) = 0,4; boundary{w^) = 0,55; boundary{w'^) = 0,25; 
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boundary{w^) = 0,8. After being sorted, they partition the intervall [0, 1[ into 
regions (fig 1). 



boundary (w‘^) boundary(w^) boundary(w^) boundary(w^) boundaryiw^) 

[ ^ ^ ^ [ 

0 0.25 0.4 0.55 0.8 1 

Figure 1.— The partition of [0, 1[ into regions according to the 5 values 
boundary {w‘‘ ) . 

If for example code{x) belongs to the region between the boundaries 
boundary (w^) and boundary (w^) , then we have 

code{x) < boundary (w^) which implies, according to the jump-lemma, 

[a; -I- — [x] = = 0 

code{x) > boundary{w^) which implies [x + w'^] — [x] = -1-1 = 6 

code(x) > boundary(w^) which implies [x + — [x] = -1-1 = 3 

code{x) > boundary{w'^) which implies [x + — \x] = [w"*] -1-1=4 

and code{x) < boundary{uf’) which implies [x + w^] — [x] = [ru®] = —2. 

Thus the region lying between the boundaries boundary{w^) and boundary{w^) 
corresponds to the local configuration (0, 6, 3,4, —2). According to the jump- 
lemma, each region corresponds in the same way to a local configuration (fig 2) . 



boundary{w^) boundary{w‘^) boundary{w^) boundary{w^) boundary{w^) 

[ ^ ^ ^ ^ ^ ^ ^ ^ [ 

0 I 0.25 j 0.4 I 0.55 | 0.8 | 1 

(0,5, 2, 3. -2) (0,5, 2, 4, -2) (0,6, 2, 4, -2) (0,6, 3, 4, -2) (0,6, 3, 4,-1) 

Figure 2.— The regions and their corresponding local configurations 

The local configuration at a point x is the one which corresponds to the region 
of code{x). At X = 4,33 for example, code{x) = 0,33 and we conclude that at 
4, 33, the local configuration through W is (0, 5, 2,4, —2). 

From a mathematical point of view, since our window has size s, there are at 
most s different regions and then the number of different local configurations is 
bounded by s. It follows that the integer part function cannot have more than 
s different local configurations through a given window of size s. Now, we are 
going to use the same way of thinking to investigate the local configurations of 
digital hyperplanes. 

4 The local configurations of a digital functional 
hyper plane 

Let P be any functional in x„ digital hyperplane defined by the double-inequality 

(I) h < oixi -I- 02 X 2 H -I- OiXi + 1- a„x„ < h+\an\ 

where ft- G K., (oi, . . . , a^, . . . , a„) G R” with a„ different from 0 and where 
the variable x = (xi, . . . , x^, . . . , x„) belongs to Z". The coefficients o^ are not 
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supposed rational and it is also the definition used in [V99]. In [R91] on the other 
hand, the numbers ai are only taken in Q. The functional digital hyperplane P 
is the graph of a map p : Z”“^ — > Z. If the coefficient a„ is positive, p{x) = 
-[{Yh=i ~ h)/an]- If an is negative, p{x) = [{h - / °-n]- Then 

we have 

n— 1 

p{x) = e[(^ aiXi - h)/\an\] 

i=l 

where e = — I if a„ is positive and e = +I if a„ is negative. In order to investigate 
the local configurations of the function p, we need a lemma inspired by the 
investigation of the integer part function. 

4.1 Trampoline-lemma 

This term “trampoline” refers to the fact that this lemma is both a consequence 
and a multidimensional generalization of jump-lemma. Before writing it out, we 
will introduce the functions code(i){x) and boundary 

Definition 4.1 .— We call codc(i) the function 

code(i) : — > [0, |a„|[ 

n-1 

x= (xi,X 2 ,---x„_i) I — >code(i)(x) = aiXi - h) mod |a„| 

i=l 

and boundary (I) the function 

boundary (I) : — >]0, |an|] 



n-1 

w = (wi, W 2 , • • • w„_i) I — > boundary (j) (w) = |an| — aiW,) mod |an|) . 

i=l 

With these two functions, we can establish a lemma of jump similar to the 
jump- lemma except that the functions are no more [ ], code and boundary but 
p, codc(^j) and boundary 

Lemma 4.1 (trampoline-lemma)— Let x and w be two elements of , we 

have p(x -I- w) — p(x) = aiWi)/|an|] z/code(i)(x) < boundary q) (w) 

= e[(X)r=i^ aiWi)/|an|] + e otherwise. 

Proof. We can prove trampoline-lemma in a more general framework. If we 
examine the function p{x), it appears as the restriction to Z"~^ of a function 
q{x) of the form k[a.x — 1] where k and I are some real numbers, where a G 
and where the variable x is taken in . We have q{x -I- w) — q{x) = k{[{a.x — 
1) a.w] — [a.x — ?]) and according to the jump-lemma, this difference is equal to 
k[a.w] if {a.x — 1}<1 — {a.ic} and to k[a.w] k otherwise. Then we have here a 
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multi-dimensional generalization of jump-lemma. For the particular case which 
interests us, (a.x—l) mod 1 is ((X]r=/ ~^)/Wn\) mod 1 and 1 — (a.ru mod 1) is 

1 — ((X)r=i~i^ o.iWi /lunl) mod 1). Instead of working with these values, we multiply 
them by |a„| so that they would be integer values if (ai)i<i<„_i € Thus 

we obtain the functions code( 7 ), boundary and the lemma. 

Remark.— Trampoline-lemma is fundamental because the investigation of the 
differences p{x + w) — p{x) (where x and w are in Z"~^) which is a multidimen- 
sional combinatorial problem is reduced to unidimensional “reflections” . Thus 
the two functions code^j) and boundary(^j^ defined from a double-inequality (I) 
come to light as two powerful arithmetical tools for the study of the local prop- 
erties of digital hyperplanes. 



4.2 The local configurations of the function p 

Let W = be a window of Z"“^ and (c^)i<i<s the local 

configuration of the function p through IT at a point x. By definition, the value c* 
is p(x+w’’)—p(x) and we could compute it directly. According to the trampoline- 
lemma, c* is equal to e[(X]r=i^ o,iWi) /\an\\ if code(^p){x) < boundary and to 
o.iWi)/\an\] +e otherwise. It shows that the local configuration at point 
X only depends on the values e[(X]r=i /\an\\ and on the comparison of 
code(^i){x) with the s values boundary . These s numbers boundary 
called the boundaries can be sorted and this is an essential point. So ordered, 
they partition the inter vail [0, |a„|[ into s regions which are closed on the left 
and opened on the right side (when two boundaries are equal, the region that 
they surround is empty). To sum up, we can say that for a fixed function p 
and a fixed window IT, the local configuration of p through IT at a point x only 
depends on the region where there is codet^i){x) . Then each region corresponds to 
a local configuration and the local configuration at x is the local configuration 
associated with the region of code(j^{x)dNe will illustrate this analysis by an 
example. 

Example.— Let P be the digital plane functional in x^ and characterized by 
the double-inequality 

(I) —2 < 3xi -I- 4x2 + 7a;3 < 5 

The digital plane P is the graph of the function p : 1? — > Z defined by 
p(xi, X 2 ) = — [(3xi -|-4x2-I-2)/7]. Let IT be the window of I? of size s = 5 where 
= (0, 0), u? = (—2, —1), ufl = (1, 0), w'^ = (2, 0) and = (2, 1) (fig 3). 



Figure 3.— The window IT 
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The local configuration of the function p at a point a; G is the element of Z® 
defined by (p(x + w^) —p{x); p{x + w'^) —p{x); p{x + w^) —p{x); p{x + w‘^) —p{x); 
p{x + w^) — p{x)). The previous arithmetical tools are the functions 
code(^i) : 1? — > [0,7[ defined by code(^j^{xi,X 2 ) = (3xi + 4a;2 + 2) mod 7 
and boundarip^i) : 1? — ®’]0,7] defined by boundary (^j'^{wi,W 2 ) = 7— ((3wi + 
4 ^ 2 ) mod 7). The boundaries associated with the window W are the values 
boundary(^j){w^) = 7, boundary = 3, boundary = 4, 
boundary = 1 and boundary = 4. After being sorted, they parti- 
tion the interval [0,7[ into regions (fig 4). 



boundary {w^) boundary {w'^) boundary(w^)— boundary (w^) boundary (w^) 

E ^ ^ ^ [ 

0 13 4 7 

Figure 4.— The partition of [0, 7[ into regions according to the 5 values 
boundary 



If for example, code(^j){x) belongs to the region between boundary = 1 
and boundary = 3, then we have: 

code(^i){x) < boundary = 7 which implies according to the trampoline- 
lemma p{x + w^) - p{x) = e[(X;Li = -[(0.3 - 0.4)/7] = 0; 

code(^i){x) < boundary (^i){uP') which implies 

p{x + W^) - p{x) = c[(ELi a,.w^,)/\a,\] = -[(-2.3 - 1.4)/7] = 2; 
code(^i){x) < boundary which implies 

p{x + w^) - p{x) = e[(ELi = -[(1-3 + 0.4)/7] = 0; 

code(^i){x) > boundary which implies 

p{x + w‘^) - p{x) = e[(ELi ai-'^*^)/|a 3 |] + e = -[(2.3 -f 0.4)/7] - 1 = -1; 
code(^i){x) < boundary which implies 

p{x + w^) - p{x) = e[(X; Li o-^.w^,)/\az\] = -[(2.3 -h 1.4)/7] = -1. 

Then the region \boundaryi^i){w'^), boundary (^i'^{uP')[ corresponds to the local 
configuration drawn fig 5. 



Figure 5.— 





-1 


0 0 


-1 



0 

The local configuration associated to the region 
\boundary{w'^) , boundary {vrP')\ 



In the same way, the different regions correspond to the local configurations 
drawn fig 6. 
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Figure 6 The regions and their corresponding local configurations. 



The local configuration at x is the one which corresponds to the region of 
code(^i){x). At (xi,X 2 ) = (—3,5) for example, we have code(/)(— 3, 5) = 6 and it 
follows that at (—3, 5) the local configuration is (0, 1,-1, —1, —2) (fig 6). 

This analysis provides the arrangement of the local configurations in a digital 
hyperplane. In particular, we have partitioned the interval [0, |a„|[ into at most 
s disjoined regions. From this it follows that the function p cannot have more 
than s different local configurations through a given window of size s. It provides 
the following theorem: 

Theorem 4.1 .—A functional digital hyperplane has at most s different local 
configurations through a given window of size s. 

This theorem generalizes to all windows and all dimensions a result already 
known in dimension n = 3 for the rectangular windows [R95,V99]. Some other 
results concerning symmetric local configurations can be deduced from the anal- 
ysis we have developed in this part. 



5 Applications 

By using the functions code^i) and boundary we can prove the following 
theorem: 

Theorem 5.1 .— (direction-theorem)— Two digital hyperplanes P and P' of 

respective double-inequalities h < < h -|- |a„| and h' < ^ 

h' -I- |an| have the same set of local configurations through a given window. 

This theorem is trivial for the digital hyperplanes which have a rational normal 
direction because they are image one from the other by a translation. It holds 
in the general case for which such translations do not necessarily exist because 
the windows are by definition finite. If one considered infinite “windows”, the 
theorem would be false. The proof comes from a lemma of presence, which says 
that a local configuration is obtained if and only if it corresponds to a non-empty 
region. 

This theorem is useful because it allows the normal directions or in other words 
the normal cone of a given local configuration to be defined and the Fourier- 
Motzkin algorithm can be used in order to compute it [FST96] (for rectangular 
windows, we can also use [D95]). 
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The functions code/^i) and boundaryi^i^ can be used to obtain several results and 
we are going to illustrate their use by sketching a proof to the well-known fact 
that there exist exactly 40 tricubes (enumeration already obtained from several 
algorithms [D95,FST96,S97]). Let us remind the reader that the tricubes are 
the local configurations of the functions p{x,y) = —[{ax + by — h)/c\ with 
0 < a < b < c (functions from the digital planes functional in z of double- 
inequalities (I) h < ax + by + cz < h + c) through the square window = 
(0,0), = (-1,-1), = (0,-1), u;^ = (1,-1), = (-1,0), 

w® = (1, 0), = (—1, 1), w® = (0, 1), = (1, 1)} of sides 3 by 3. 

Firstly, instead of working directly with the whole domain 0 < a < b < c, we 
suppose 0<a<b<a + b<c. With a, b, c taken in this more restrictive 
domain, the values — [(a.w^ -I- b.Wy)/c] which appear in trampoline-lemma for 
p{x + w^) —p{x) are determined. Then we know that, p{x + w^) — p{x) is always 
null, the values p(x-|-w^)—p(x), p{x+w^)—p{x), p{x+w"^)—p{x), p{x+w^)—p{x) 
are equal to 1 or 0 and the values p{x + w®) — p{x), p{x + — p(x), 

p{x + ru®) — p{x), p{x + w^) — p{x) are equal to 0 or —1. We have also 
boundary (^i){w^) = c, boundary (^i){uP‘) = a + b, boundary (^j'j{w^) = b, 
boundary = b — a, boundary (^i-^{w^) = a, boundary ^i){w^) = c — a, 

boundary (^j'){w’^) = c + a — b, boundary(^i){w^) = c — b and boundary (^i){w'^) = 
c — a — b. Now, for a fixed value c, we draw the domain of (a, b) C verify- 
ing 0<a<6<a-|-6<c and the straight lines of equations boundary{w'^) = 
boundary{w^) (fig 7). On one side of such a straight line, we have boundary{w’‘) > 
boundary{w^) and on the other side, we have boundary{w^) < boundary{w^) . 




Figure 7 .— the domain Q<a<b<a + b<c cut into areas 



The areas of the domain 0<a<6<a-|-6<cso cut up correspond each one to 
an order of the boundaries. As we have seen that the values — [{a.wl + b.w\)/c\ 
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were constant in all the domain 0<a<b<a + b<c, each area and its 
corresponding order of the boundaries gives 9 local configurations (fig 8). 
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Figure 8 The orders of the values boundary in the areas 1, 2, 3 
and the associated tricubes. 



A presence-lemma ensures that any digital plane with its coefficients a, b, c in 
the interior of an area takes the 9 corresponding local configurations and one 
concludes that those are always 9 tricubes. Thus, this construction provides all 
the tricubes of the domain 0<a<6<a-|-6<c. Ifwe then study the domain 
0<a<&<c<a-|-6in the same way, we obtain the 40 tricubes shared out in 
the figure already presented in [CV97]. 



6 Conclusion 

In this paper, the main idea is to look at digital hyperplanes through windows 
of any shape. Instead of directly tackling the problem of the local configurations 
of the functional digital hyperplanes, we have investigated the local configura- 
tions of the integer part function. Their study does not require any other tools 
than a lemma (jump-lemma) and two functions {code and boundary). In order 
to tackle our initial problem, namely the local configurations of the functional 
digital hyperplanes, we can generalize the lemma (trampoline-lemma) and the 
two functions. In fact, in this multidimensional framework, the situation is ex- 
actly the same as for the integer part function except that the maps are no 
longer [ ], code and boundary but p{xi, X 2 ■ ■ ■ Xn) = e[(X]”=i^ — /i)/|a„|], 

codei^i) and boundary . These arithmetical tools allow us to compute the local 
configurations of a functional digital hyperplane and describe their deployment. 
They can be used to prove many theoretical results as we have seen with the 40 
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tricubes. Generally speaking one can say that they permit us to go easily from 

a double-inequality of a digital hyperplane to the local configurations. 
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Abstract. A digital naive plane can be represented by repetition of 
specific elements, called (n, m)-cubes, composed of nxm adjacent voxels. 
The aim of this paper is to study the class of (n, m)-cubes appearing in a 
plane in relation with the parametric representation based on the normal 
vector. Planes are ordered using Farey series coding and we prove the 
relationship between the segmentation issued from the Farey net and 
configurations of {n, m)-cubes. This is an original contribution. 



1 Introduction 

The topological development as well as the geometrical development of the dis- 
crete space (.^") theory is continually increasing. Algorithmic problems which 
are difficult to solve by the Euclidean geometry are easier to solve in the discrete 
context as it is the case for object representation or squelettization problems [3]. 
Moreover, the arithmetic definitions of discrete lines and planes [H][l] allow the 
entire discrete analysis of objects. 

The problem of recognizing digital lines [4] is now solved but the generalization 
to discrete planes [4] [5] is not optimal. It would be interesting to recognize digital 
planes by their geometry. The study of the coexistence of tricubes in a discrete 
plane has been studied by different authors either by Fourier algorithm [7] [13] or 
by a syntaxic analysis [14]. Nevertheless the characterization of planes by pieces 
with different sizes could reduce the set of corresponding planes. 

In this paper our interest is to generalize the representation of naive planes by 
(n, m)-cubes. For this, we will see the links between the parameters of naive 
planes and their position in Farey nets via a two-dimensional continued fraction 
algorithm [9] [2] [8]. We will illustrate the way how (n, m)-cubes generators of the 
naive plane can be identified to the parameters of the naive plane. Some results 
on 2D digital lines [6] [10] are here extended to 3D digital planes. 

2 Definitions 

A digital naive plane of normal vector (a, b, c) and translation parameter r is 
defined as the set of points M{x,y,z) G satisfying the double-inequality: 

Q < ax + by + cz + r < max(|a|, |6|, |c|) 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 76—87, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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where all parameters a, b, c, r are integers and a, b, c are not null all together 
and verify gcd(a, 6, c) = 1. We limit our study to naive planes in the 24^^ part 
of space such that 0<a<c, 0<6<c and c 0. These planes will be noted 
by V{a,b,c,r). 

We call remainder at point M{x, y, z) of the naive plane V{a, &, c, r) the value 
TZ{x, y) = (ax + by + r) mod (c). 

The lower leaning points (resp. upper leaning points) of the naive plane 
V(a, 6, c, r) are the points M(x, y, z) satisfying TZ(x, y) = 0 (resp. 7Z(x, y) = c— 1) 
(fig.l(a)). 

For n > 2 and m > 2, the (n,m)-cube at point (i,j) of the naive plane V 
is defined as the set C(i, j,n,m) = {(x,y, z) € V / i < x < i + n, j < y < j + m} 
(fig.l(b)). 




(a) (b) 

Fig. 1. (a) Part of the naive plane P{3, 7, 19, 0); in dark gray we have the upper leaning 
points and in light gray, the lower leaning points, (b) A (3, 4) -cube o/"P(3, 7, 19, 0). 

Previous works [12] have shown that a naive plane satisfies the two following 
properties: 

Property 1. A naive plane contains at most nm different configurations of (n, m)- 
cubes. 

Property 2. Each (2n — 1,2m — l)-cube centered on a leaning point (upper or 
lower) of a naive plane contains all the different configurations of (n, m)-cubes 
that can be encountered in this plane. 




Fig. 2. 3 X 3 and 5x5 blocs centered on a leaning point of the naive plane P(3, 7, 19, 0) 
with the configurations of (2, 2) -cubes (or bicubes) and (3, 3)-cubes (or tricubes). 
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Definition 1. 

- {n,m)-cubes at points (xi,yi) and {x 2 ,y 2 ) geometrically equal if and 
only if the difference TZ{xi + ki,yi + k 2 ) — 'R-{x 2 + k\,y 2 + ^ 2 ) is a constant 
for all (fci, ^ 2 ) G [ 07 ^— 1 ] X [O 7 TO — 1] . 

- {n,m)-cubes at points (xi,yi) and {x 2 ,y 2 ) are geometrically symetrics if 
and only if the sum TZ{x\ + fci, 7/1 + ^ 2 ) + Ti{x 2 + n — l — ki,y 2 + rn—l — k 2 ) 
is a constant for all (/ci, fe) G [0, n — 1] x [0, to — 1]. The symetry is defined 
with respect to the center of the volume delimited by the {n,m)-cube. 




a c 



Fig. 3. In the naive plane V{7, 13,23,0), the (2,4)-c«6e at point (2,0) (a) is geomet- 
rically equal to the (2,4)-c«6e at point (0,1) (b) and geometrically symetric to the 
{2,4)-cube at point (3,-1) (c). 

Then we verify the new following proposition (see figure 4 for illustration): 

Proposition 1. Let xi, yi be the abscissa and ordinate of a lower leaning point 
and Xu, yu those of an upper leaning point of a same naive plane V{a,b,c,r). 
Then, for all {a, (3) G the configuration of the (n, m)-cube at point of abscissa 
xi~\-a and ordinate yi + (3 is geometrically symetric, with respect to the center of 
the volume delimited by the {n,m)-cube, to the {n,m)-cube at point of abscissa 
Xu — n 1 — a and ordinate — to + 1 — /3. 

Proof. For all (a,f3) G we have the relation T^(a;/ + a,7/; + /3)+7^(a:„ — Q:,7/„ — 
/?) = c — 1. So according to definition 1, the configuration of (n, TO)-cube at point 
{xi a,yi (3) is geometrically symetric to the configuration of (n, TO)-cube at 
point {xu — n 1 — a,yu — rn 1 — (3). 




Fig. 4. Representation with the remainders of the two symetric {3,5)-cubes which are 
attached to point (2,2) and (7,5) of the naive plane "P(3, 7, 37, 0). The projection on 
the (Oxy) plane of the lower leaning point (voxel in light gray) is the point (3,4). The 
projection on the (Oxy) plane of the upper leaning point (voxel in dark gray) is the point 
(8,7). 
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With proposition 1 and property 2, we can limit our study to parts of plane 
that are centered on lower leaning points. We deduce by translation that the 
(2n — 1, 2m — l)-cube S of the naive plane V{a, b, c, 0) which is centered on the 
origin (a lower leaning point of this plane) contains all the configurations of 
(n,m)-cubes that are needed to generate the planes V{a,b,c,r). In fact we can 
remark that, for r = 0, • • • , c — 1, the (n, m)-cube C(0, 0, n, m) of the naive plane 
■p(a, b, c, r) is similar to one of the (n, m)-cubes contained in S. Consequently, we 
will looking for the different configurations which appear around leaning points 
of naive planes passing through the origin. First we need to recall some notions 
about discretization of real planes. 

3 Discretization of planes and hyper Farey nets 

The discretization of an oriented curve by object boundary quantization (OBQ) 
consists to take the nearest points of the discrete grid which are on the curve or 
on the right of the curve. 

The OBQ-discretization of a rational plane z = _fi^±hdi£ where 0 < a < c, 
0 < b < c and c yf 0 gives the discrete naive plane P(a, b, c, r). 

Let Vr be the real plane z = —{ax + (3y + j) where 0<a<l, 0</3<! and 
0 < 7 < 1. For n > 2 and m > 2, the OBQ-discretization of the plane Vr on the 
set V{i,j) = { (x, y) G ji<x<i + n, j < y < j + m} is the set Cr defined 
by: 

CR{i,j) = {{x,y,z) & I (x,y) G U(t, j), 0 < ax -k /3y -k z -k 7 < 1} 

As the set of rational numbers is dense in the set of real numbers, we always can 
find four integers a, b, c and r satisfying the following conditions: 

1) 0 < a < c, 0 < 6 < c, c yf 0, 

2) 0 < r < c, 

3) gcd(a, b, c) = 1, 

4) the discretization on V(i,j) of the plane z = —{ax + Py + j) is similar to the 

discretization of the rational plane z = Note that by definition 

the set of discrete points CR{i,j) is exactly the (n,m)-cube C{i, j,n,m) of 
the discrete naive plane V{a, b, c, r). 

Consequently, all rational numbers “, ^ and )) satisfying these conditions are 
“good” rational approximations of the real values a, P and 7. 

By taking the discretization of the real plane z = —{ax + Py) on the rectangle 
of size (2n — 1) x (2m — 1) centered on the point (0,0) , we obtain the discrete 
set S defined by: 

5 = { (x, y, z) G / — n < X < n, —m <y<m, z— 1< —{ax + Py) < .z} 

Using property 2, S contains all the different configurations of (n, m)-cubes 
appearing in the discretization of rational planes for which parameters are ap- 
proximations of a and p. 
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One can notice that there exists an infinite number of real planes which have 
the same discretization. So we define the equivalence class of S by: 

5 = {(a,/3) G [0, V\^ /y{x,y,z) e S z - 1 < -(ax + Py) < z} . 

There is a duality between the cartesian representation ax+Py+z = 0 of the real 
plane and its parametric representation by the point (a, P) . A line xa+yP+z = 0 
of the parametric space is the set of points (a, P) which represent the parame- 
ters of real planes passing through the point (x, y, z). The equivalence classes of 
the different configurations appearing around a leaning point are defined by sets 
of inequalities that decompose the unit square {(a,/?)/0 < a < 1, 0 < /3 < 1} of 
the parametric space into a set of polygons. This combination of lines is called 
an hyper Farey net associated with (n, m)-cubes.An illustration will be 
provided on figure 7. 

Since the cartesian representation of a discrete plane is given by a double in- 
equality, each discrete point (x,y,z), x = —(n — — 1, y = —(m — 

1), • • • , m — 1 and z = min(0, —(x + y), —x, —y), ■ ■ ■ , max(0, —(x + y), —x, —y), 
is associated with an half-open band B(x,y,z) in the parametric space de- 
limited by two parallel lines T>(x,y,z) and T>(x,y,z — 1), where T>(x,y,z) = 
{(a, P) G [0, Vp / xa + yP + z = . Each band B(x, y, z) represents the set of 

parameters (a,P) of real planes for which the discretization includes the point 
(x,y,z). 



Example 1. Let us analyze the (3x3) bloc S centered on the origin of a naive 
plane (illustration on figure 5(a)). Its equivalence class S is defined by the in- 
equalities — 1 < —a — /3 < 0, — 1 < —a < 0, 0 < —a -I- /3 < 1, — 1 < ~P < 0, 
0</3<l, -1 <«-/?< 0, 0 4a<l, 0<a-h/3<l. 

After reductions, the domain S illustrated on figure 5(b) is defined by the 4 
inequalities 0<o;-|-/3<l, 0<o;<l, 0</3<l, 0< —a + P <1. 





(a) (b) 

Fig. 5. (a) (3,3)-cube S centered on the origin; (b) domain S of parameters (a,P) 
related to the real plane ax -\- Py + z = Q having S as diseretization around the origin. 




(n, m)-Cubes and Farey Nets for Naive Planes Understanding 



81 



Figure 6 represents all the configurations of (3, 3)-cubes which can appear around 
the origin and the corresponding hyper Farey net. 




Fig. 6. (a) Black points are points appearing in naive planes "P(a,6, c, 0) in the 24“ 
part of space. For instance, the points numbered from 1 to 9 belong to the naive plane 
Q < y -\- z < 1. (b) Representation with the values xa + yfd of the (3, 3)-cubes centered 
on the origin, (c) Hyper Farey net attached to {2,2)-cubes. Each band B(x,y,z) = 
k)k£K,o<k<i'B{x,y,z — k) of the net is the parameter set of real planes Fr such that 
the point M(x,y,z) (black point of (a)) is in the discretization ofFn. 



Definition 2. A two-dimensional Farey series of order q € IN* is the set Fq of 
rational points defined by: 



'".“LI 



C=1 



a 

I G' 

c c , 



/0<a<c, 0<6<c>. 



The construction of Farey series of order q >2 is recalled here. 

Let Ai and A 2 be two points of the Farey series of order (q—l) 

satisfying Ci + C2 = q. The median point between Ai and A 2 , noted by Ai + A 2 , 
is the point with coordinates ^ ^ The Farey series of order q is 

obtained by adding to the Farey series of order (q — 1) all median points for 
which “denominator” are equal to q. 

Definition 3. Let A be a subset of the Farey serie Tq of order q > 1. 

An hyper Farey net attached to ^ zs o partition of the convex hull of A into 
triangles of vertices 



Ml 




M3 



f 03,1 03,2 
\a3.3 ’ ^3,3 



01,1 01^2 

Cl, 3 ’ Cl , 3 
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in A such as the determinant of the matrix (aij) zs ±1 and containing no points 
of A except for its vertices. 

Theorem 1. The intersection between a line of the hyper Farey net associated 
to {n,m)-cubes with another lines of the same Farey net is a subset of the two- 
dimensional Farey series of order q < 2{n — l)(m — 1). The two vertices of an 
edge of the Farey net are consecutive fractions of the one dimensional Farey 
series in abscissa and ordinate. 

Proof. The intersection of the line T){x, y, z) with the line ^{x' , y' , z'), for xy' — 
x'y yf 0 , is the point 

\yz' — y'z\ \x'z — xz'\\ 

\xy' - x'y\ ’ \xy' - x'y\) 

So the intersection of the line T>{x,y,z) with the lines V{x' ,y' , z'), x' = —{n — 
1 ), • • • , (n — 1 ) and y' = —{m — 1 ), • • • , (m — 1 ) is the set of rational points (^, 
satisfying 0 <o<c, 0 <&<c and 1 < c < 2 (n — l)(m — 1 ). 



Example 2. In the hyper Farey net associated 
to (2, 3)-cubes (fig. 7), the intersection of the 
line a — (3 = 0 with the lines a + 2/3 = 0, 

0 + 2/3= 1, o + /3 = 1, 0 + 2/3 = 2 and 
o + 2/3 = 3 generates the points (0, 0), ( 5 , 5 ), 

(|) I) and ( 1 , 1 ). The set of abscissa 
and the set of ordinates taken in ascending 
order are associated with the one dimensional Fig. 7. Hyper Farey net associated 
Farey series of order 3: {O, 1} {2,3)-cubes. 

We study now the relation between the position of the parameters (o, /3) in the 
Farey net and the different configurations of (n, m)-cubes of the naive plane 
7^(a, 6 , c, 0) where (f , j) is a “good” approximation of (o,/3) (as it was intro- 
duced at the begining of this section) . Let have a look at the different positions 
for (a,/3) with respect to the k-faces, k=0,l,2, of the lines arrangement of the 
hyper Farey net. 

Case 1: If the point (a, /3) is a vertex of the arrangement then it is a ratio- 
nal point expressed by (“, with c < 2(n — l)(m — 1). The discretization of 
the rational plane of parameters (a, /3) which includes the origin is the naive 
plane P(a, &, c, 0 ) containing strictly p (p < nm) configurations of (n, m)-cubes. 
Only the planes of normal vector (a, b, c) are generated by these p (n, m)-cubes. 
The point (a,/3) is also a vertex of the arrangement of the Farey net associated 
with (n^, m^)-cubes where n' > n and m! > m. The associated naive plane is 
generated by exactly p (n', m')-cubes. 

Case 2: If the point (a, /3) belongs to an edge of the arrangement then the me- 
dian point bi-^b 2 \ the two vertices — — 'l and f — — V is a 

^ C1-1-C2 ’ C1-1-C2 j y Cl ’ Cl y C 2 ’ C 2 / ’ 
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“good” rational point of minimal denominator approaching (a, P). The real plane 
of parameters (a,/3) and the rational plane of parameters ^ have 

the same discretization on the set {{x,y) G / — n < x < n, —m < y < m}. 
So they have the same configurations of (n, m)-cubes. If Li and L2 are the 
lists of (n, m)-cubes appearing in the planes of normal (oi, &i, Ci) and (02, 62, C2) 
then Li U L 2 is the list of (n, m)-cubes appearing in the naive plane of normal 
(oi + 02, 61 + &2, Cl + C2) with Card(Li U L 2 ) < nm. 

Case 3: If the point {a, P) is on a face of the arrangement, we have 3 or 4 
vertices bounding the face. The median point in case of 4 vertices corresponds 
to the median point issued from two opposite vertices (see case 2). In case of 

3 vertices, the median point of the 3 vertices is a “good” rational point (y, |) 
with minimal denominator approaching (a, P). The naive plane with parameters 
corresponding to the median point is composed by union of (n, m)-cubes issued 
from vertices. The plane is composed of exactly nm different configurations of 
(n, m)-cubes. All rational plane having their parameters in the face of the ar- 
rangement are discretized in naive planes generated by exactly nm (n, m)-cubes. 

4 Relation between Farey series and representation by 
(n, m)-cubes 

The different configurations of (n, m)-cubes appearing around leaning points are 
in one to one correspondance with vertices, edges and faces of the lines arrange- 
ment of the hyper Farey net. We are going to see through some examples how 
(n, m)-cubes can be constructed similarly with the computation of Farey series. 

Farey serie of order 1 and basic elements of planes 

The two-dimensional Farey serie of order 1 is the set of points (0, 0), (0, 1), (1, 0) 
and (1, 1) corresponding to the dual representation of the basic naive planes 
0<2;<1, 0<y-|-z<l, 0<x-|-z<l, 0<x-|-?/-|-z<lof the same 24^^^ 
part of space (see figure 8). We define a basic element as to be a set of three 
neighbour voxels not aligned. The basic planes are obtained by repetition of a 
basic element (figure 9). The configurations of basic elements are sufficient to 
generate naive planes. 



(b) 

Fig. 9. (a) Basic naive plane of normal 
Fig. 8. Hyper Farey net of order 1 and basie vector (0,1,1); (b) list of the 4 basic ele- 
elements generators of the basic planes. ments involved in this plane. 
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Hyper Farey net associated with bicubes 

On figure 10 we can see the basic planes generated by (2, 2)-cubes. On this 
example the new point of coordinates (^, is introduced by using the transition 
rules from the Farey serie of order 1 to the Farey serie of order 2. We introduce 
the median point between (f, f) and (x)x)or(Y,Y) and (x; x)- naive plane 
of normal vector (1, 1, 2) is generated by two (2, 2)-cubes. The representation in 
(2, 2)-cubes of each vertex of the hyper Farey net is obtained by concatenation 
of the basic elements. Concatenation is defined as following. 




Fig. 10. Hyper Farey net associated with {2,2)-cubes. 



The rules of construction of the (n, m)-cubes, for n > 2 and m > 2, are defined 
as follows: 

[S'] and [S'] will design two (ni, mi)-cubes. Their symetric will be noted [— S] 
and [—S'] (symetry with respect to the center of the volume including the (n, m)- 
cube). 

[S"] and [— S"] will design two symetric (ri 2 , m 2 )-cubes. 

[N] and [A^'] will design two neutral (ni , mi )-cubes (geometrically equal to their 
symetric) . 

[fV"] will design a neutral (ri 2 , m 2 )-cube. 

The operator of concatenation (noticed by “+”) between two (ni, mi)-cubes 
is defined as the union of them by a common band of voxels along lines or 
columns. 

The concatenation of two (ni, mi (-cubes generates a (n 2 , m 2 (-cube with the 
following constraint: 



rii = Ti 2 and mi < m 2 or ni < ri 2 and mi = m 2 

With respect to the classification of (n, m)-cubes in terms of symetric or neutral, 
we have: 
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- [5] + [iV] = [^"] 

- [ 5 ] + [ 5 '] = [ 5 "] 

- [5] + [-^] = [7V"] 

- [N] + [iV'] = [5"] 



The different examples will illustrate the different rules to construct the (n, m)- 
cubes appearing in a plane. To do that we will increase n and m values. 



Representation by (2, 3)-cubes 

In figure 11, the point C (|, |) is the median point (in the sense of Farey se- 
ries) between ( 5 , 5 ) and yl (j, j). The plane corresponding to B is generated 
by two bicubes and the plane corresponding to A is generated by one bicube. 
The concatenation of the two bicubes of B generates two neutral (2,3)-cubes 
but only one can coexist in a plane with the bicube of A. The association of 
the bicube of A with the two bicubes of B generates two symetric (2, 3)-cubes. 
Finaly, C is generated by three (2,3)-cubes. The point D is generated by 

four ( 2 , 3)-cubes issued from the concatenation of the three bicubes appearing 
in C (bicubes of A and B) with the bicube appearing in E (there is not neutrals 
in E). 




Fig. 11. Hyper Farey net associated to {2,3)~cubes. 



Representation by (3, 3)-cubes 

In figure 12, the point F’ Q, |) is the median point between C (|, ^) and yl (j, 2). 
It is composed of four (3, 3) -cubes issued from the concatenation of the neutral 
(2,3)-cube attached to A with the neutral and the two symetric (2,3)-cubes 
attached to C. 
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Fig. 12. Hyper Farey net associated to {3,3)-cubes. 



Representation by (3, 4)-cubes 

In figure 13, the point G (g, |) is the median point between F Q, |) and A (j, j). 
The plane associated with point F is generated by four (3,3)-cubes two by two 
symetrics. The concatenation of these (3, 3)-cubes generate four (3,4)-cubes, 
two symetrics and two neutrals with only one compatible with the (3,3)-cubes 
of A. So the plane associated with point G is composed of three among the four 
(3, 4)-cubes of F and of the (3, 4)-cubes generated by concatenation between the 
(3, 3)-cubes of F and the (3, 3)-cube de A. 





Fig. 13. Hyper Farey net associated with (3,4)-cti6es. 
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5 Conclusion 

A new approach has been discussed for digital naive plane understanding. Start- 
ing from the arithmetic formulation of a digital plane given by 0 < ax + by + 
cz + r < c, vie propose an identification process to list the set of bloc elements 
called (n, m)-cubes encountered on the plane. We proved that the incremental 
characterization of a Farey net (issued from the parametric representation (^, ^) 
of the plane) can be used to identify the list of different (n, m)-cubes. 

Future works will focuse on the definition of an algorithmic way to recognize 
digital planes. The method will be incremental on n and m values. It will pass 
alternatively from (n,m)-cubes structure to the corresponding Farey net. 
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Abstract. The goal of this paper is to generalize the notion of lighting 
function given in [3] in order to integrate strong 26-surfaces [5] into our 
framework for digital topology. In particular, the continuous analogue for 
strong 26-surfaces introduced in [10] is extended for arbitrary objects. 

Keywords: Lighting function, digital surface, strong 26-surface. 



1 Introduction 

In a series of papers [1,2,3] we introduced an approach to the notion of digital 
space within a new framework for digital topology. This framework is presented 
as a multilevel architecture which provides a link between a device level, where 
the discrete nature of digital objects is represented, and an Euclidean space. 
The most elaborate notion of digital space was given in [3] by introducing the 
notion of lighting function, which intends to formalize the idea of “continuous 
perception” that an observer may take on digital objects. In this way, a digital 
space is not only determined by a device model but also by a lighting function 
defined on it. In some sense, such a function is providing a method to construct a 
continuous analogue from each digital object, which is actually the “continuous 
perception” we are considering on that object. 

Based on the idea of “face membership rule” from Kovalevsky [9], lighting 
functions were originally defined through a set of four axioms, allowing a wide 
family of “continuous perceptions” used in literature. In fact, for all a,P € 
{6, 18,26} there exist lighting functions providing the (a, /3)-connectedness, de- 
fined on within the graph-based approach to digital topology [8] ; and, more- 
over, (a, /3)-surfaces [11,7] are also found as surfaces in the corresponding digital 
spaces, for {a, (3) yf (6,6). However these four axioms are not general enough to 
replicate the “continuous perception” associated with strong 26-surfaces in [5]. 

Our main goal in this paper is to introduce a set of axioms for lighting 
functions more general than that in [3, Def. 1]. These new axioms allow us 
to find a suitable digital space whose surfaces are exactly the set of strong 
26-surfaces (Th. 5). Several consequences are derived from this result. Firstly, 
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in [10] Malgouyres and Bertrand define a continuous analogue only for strong 
26-surfaces, which is now extended for arbitrary objects in our architecture. 
Moreover, results (e.g., the Digital Jordan-Brower Theorem) which have been 
proved in our framework with fully generality hold for strong 26-surfaces without 
further proof. Finally, our main result (Th. 5) provides a proof for possibly 
infinite strong 26-surfaces of the local characterization given in [10, Th. 6] only 
for finite strong 26-surfaces. 

These generalized lighting functions are introduced in Section 2, where we 
also recall some of the basic aspects of our framework, as well as some notations 
and results used in this paper. In Section 3 we give the appropriate digital space 
for which digital surfaces coincide with strong 26-surfaces. This is proved in 
Sections 4 and 5. 

2 The multilevel architecture 

As in [1,2,3] the first level of a digital space, called the device model, is used to 
represent the spatial layout of pixels, which are represented by the n-cells of a 
homogeneously n-dimensional locally finite polyhedral complex K . Namely, K is 
a complex of convex cells (polytopes) such that each cell is face of a finite number 
(non-zero) of n-cells. If 7 is a face of a we shall write 7 < cr, and 7 < cr if in 
addition 7 yf cr. If | AT | denotes the underlying polyhedron of K, a centroid-map 
is a map c : K ^ \ K \ such that c(cr) belongs to the interior (as a cell) of cr; 
that is, c(cr) G cr — da, where da = Ujy; 7 < cr} stands for the boundary of a. 
The set of all n-cells of K will be denoted by cell„(Ar). Given a device model K, 
a digital object in K is a, subset of the set cell„(A') of n-cells in K. 

In this paper we will deal with the device model i?", called the standard 
cubical decomposition of the n-dimensional Euclidean space K". This device 
model i?" is the complex determined by the collection of unit n-cubes in IR” 
whose edges are parallel to the coordinate axes and whose centers are in the 
set The centroid-map we will consider in i?" associates to each cube a its 
barycenter c(a). In particular, if dima = n then c(ct) G where dimer stands 
for the dimension of a. So that, every digital object O in i?" can be identified 
with a subset of points in Henceforth we shall use this identification without 
further comment. 

Now we are ready to introduce the notion of weak lighting function general- 
izing lighting functions in [3]. For this we need the following notation. 

Given a cell a G K and a digital object O C cell„(Ai), the star of a in O 
and the extended star of a in O are respectively the digital objects stn(a; O) = 
{a G O; a < a} and st* (a; O) = {a G O] a C\ a ^ Notice that if dim a = 0 
then stn{a;0) = st*(o;;0) for any digital object O in K. The support of O, 
supp(O), is the set of all cells a G K such that a = n{a;a G st„(a;0)}. To 
ease the writing, when the digital object is the whole set cell„(AT) we shall write 
supp(AT), stn{a;K) and st*(a;AT) instead of supp(cell„(AT)), st„(a; cell„(A')) 
and st* (a; cell„(A')), respectively. Finally, we shall write V{A) for the family of 
all subsets of a given set A. 
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Fig. l.The w.l.f. / in Example 1 does not satisfy property (F3). 



Definition 1. Given a device model K, a function f : V{celln{K)) x AT — > {0, 1} 
is said to be a weak lighting function (w.l.f.) on K if it verifies the following five 
properties for all O € V{celln{K)) and a € K. 

1. If a £ O then f{0,a) = 1. 

2. If a ^ supp{0) then f{0,a) = 0. 

3. f{0,a)<f{celln{K),a). 

4- f{0,a) = f{stl(a;0),a). 

5. Let O' C O and a G K such that stn{a;0) = stn{ce',0'), f{0',a) = 0 
and f{0,a) = 1. Then, the set of cells a{0';0) = {P < a;f{0',P) = 0, 
f{0,P) = 1} is not empty and connected in da. Moreover, if any digital 
object O is such that O C O, then f{0,P) = 1 for every /3 G a{0'; O). 

Properties (1), (2) and (3) already appeared in [3] as (F2), (FI) and (F4) 
respectively; however properties (4) and (5) replace to property 

(F3) f(0, a) = /(st„(a; 0),a) 

in [3]. As the equality st„(a;0) = st„(a; st* (a; O)) holds for every cell a G K 
and object O C cell„(AT), it is readily checked that property (F3) implies prop- 
erty (4). Moreover, if (F3) holds then no cell in K satisfies all hypothesis in (5). 
Hence (F3) also implies property (5). So that, w.l.f. ’s generalize lighting functions 
in [3]. Next example shows that the class of w.l.f. ’s strictly contains all lighting 
functions. 

Example 1. Let / be the w.l.f. defined on Bf by f{0,a) = 1 if and only if: 
(a) dim a = 2 and a G O; (b) dim a = 0 and a G supp(O); (c) dimer = 1 and 
one of the two following conditions holds: 

(cl) st 2 (o;; O) = st 2 (a; R^) 

(c2) a G supp(O) and there exist a,r G st^ia; R^) — 0 such that ernr = 0. 
In order to check that / does not satisfy property (F3), one observes that for the 
digital objects O and O' and the 1-cell a in Fig. 1 the equality st2(a;0) = 
st 2 (o;; 0 ') holds; however the definition of / yields that f{0,a) = 1 while 
/(O',a) = 0. 

As in [3], a digital space is defined as a pair {K, f), where AT is a device model 
and / is a weak lighting function on K. Now, given a digital object O in (AT, /) 
its levels are defined as follows. 

The device level of O is the pair (K{0), fo), where K{0) = {a G K;a < a, 
a G O} is the subcomplex of K induced by the cells in O, and fo is the restriction 
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of / to the set V{0) x K{0). In general, this pair is not a digital space because 

fo does not satisfy property (3) in Definition 1; however, fo turns to be a w.l.f. 

if it is redefined as fo{0',a) = f{0,a)f{0',a), for O' C O and a G K{0). 

The logical level of O is an undirected graph, Cq, whose vertices are the 

centroids of n-cells in O and two of them c(cr), c(r) are adjacent if there exists 

a common face a < cr n r such that f{0, a) = 1. 

£ 

The conceptual level of O is the digraph Cq whose vertices are the centroids 
c{a) of all cells a G K with f{0,a) = 1, and its directed edges are (c(a),c(/3)) 
with a < p. 

The simplicial analogue of O is the order complex Aq associated to the 
digraph Cq. That is, (xq, xi, , Xm) is an m-simplex of Aq if xq,xi, . . . , Xm is 
a directed path in Cq . This simplicial complex defines the simplicial level for the 
object O in the architecture and, finally, the continuous level is represented by 
the underlying polyhedron | Aq \ of Aq . This polyhedron is called the continuous 
analogue of O. 

For the sake of simplicity, we will usually drop “/” from the notation of the 
levels of an object. Moreover, for the whole object cell„(AT) we will simply write 
Ck, Ck and Ak for its levels. 

Next we recall the notion of connectedness given in [3] . Let O and O' be two 
disjoint digital objects in a digital space (K,f). Two distinct n-cells a,r G O 
are said to be O' -adjacent in O if there exists a common face a < a C\ t such 
that f{0' , a) = 0 and f{0 U O', a) = 1. An O' -path in O from a to t is & finite 
sequence C O such that CTo = cr, cr^ = r and Ui-i is O'-adjacent in O to 

(Ti, for i = 1, . . . , m. Then, a digital object O will be said O' -connected if for any 
pair of n-cells a,r G O there exists an O'-path in O from cr to t. And finally, an 
object C C O is an O' -component of O if for any pair a,r € C there exists an 
O'-path in O from cr to t and none element in C is O'-adjacent in O to some 
element of O — O. Observe that any O'-component is O'-connected itself. 

Given a digital object O in the digital space (AT, /) the previous definitions 
provide an entire family of notions of connectedness for O in relation to another 
object O' , when O' is allowed to range over the set of all subsets of cell„(A') — O. 
The extreme cases, when O' = 0 and O' = cell„(Ar) — O, represent the connected- 
ness of the digital object O itself and the connectedness of O as the complement 
of cell„(AT) — O, respectively. Theorem 2 shows how these notions of connected- 
ness are stated at each level of our architecture. Below, given two subcomplexes 
Li and L 2 of a simplicial complex L, the simplicial complement of L 2 in L\ will 
be denoted by Li \ L 2 = {a G Li; a n | L 2 | = 01- 

Theorem 2. Let O and O' he two disjoint digital objects in a digital space. The 
family T of O' -components of O can be described in any of the following ways 

1. Conceptual level: T = {Oq}, where Oq = {cr G O; c(ct) is a vertex of G}, 
and G ranges over the family of components of the digraph CqvjO' \ Co' ■ 

2. Simplicial level: T = |0^}, where Oa = {cr G O; c(a) G A}, and A ranges 
over the family of components of the simplicial complement AouO' \ Ao' ■ 

3. Continuous level: T = {Ox}, where Ox = (cr G O; c(ct) G X}, and X ranges 
over the family of components of the space \ AquO' I ~ I Aq' |- 
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The proof of this result follows the same sketch given in [3] , using next Lemma 
to prove part (1). 

Lemma 3. Let O, O' he two disjoint objects in K and a € K any cell such that 
f{OUO',a) = 1, /(O', a) = 0 and st„(a;OUO') = st„(o;;0'). Then there exists 
(3 < a such that f{0 U O', /3) = 1, /(O', /3) = 0 and stn{P', O U O') ^ stn{P', O'). 
Moreover, if j3' < a also satisfies the above properties then there exists a path 
{c( 7 i)}?™g in Couo' \ Co' from c{j3) to c{/3') such that 72 j-i € O, 1 < j < m. 

We finish this Section giving an intuitive motivation about the replacement 
of property (F3) in the definition of lighting functions by properties (4) and 
(5) given for w.l.f.’s. Properties (F3) and (4) state that whether a cell a is 
lighted for a given object O depends on a ‘digital neighbourhood’ N{a; O) of 
a in O. In property (F3) N{a]0) is chosen to be st„(a;0); i.e., the smallest 
neighbourhood of a in O. However the continuous analogue | I ii^^ds 

not to be, in general, a ‘continuous neighbourhood’ of the centroid c{a) in | Ao |- 
Due to this, N{a] O) is required to be st* (a; O) in property (4). With this choice 
the centroids of two isolated n-cells (Ti, (T2 G cell„(AT) — O, crincr2 = 0 , can belong 
to the same component of the continuous space | Ak \ — \ Ao | even though cti 
and CT2 are each one completely surrounded by n-cells in O. Thus, our continuous 
analogue would not provide the right representation of the continuous perception 
we take on digital objects. So, we require in addition axiom (5) to prevent this 
undesirable property. 

3 Main results 

In the multilevel architecture given in Section 2 the continuous analogue provides 
the “continuous perception” of digital objects. Thus, the following definition 
arises naturally. A digital object S' in a digital space (K, f) is said to be a digital 
manifold if its continuous analogue | | is a combinatorial manifold without 

boundary. In case S is a digital surface (2-manifold) in {R^, /) we say that S is 
an f -surface. 

In this section we state our main result. Namely, we find a digital space for 
which an object is a digital surface if and only if it is a strong 26-surface in the 
sense of Bertrand and Malgouyres [5] . This digital space is defined on the device 
model R^, the standard cubical decomposition of the Euclidean 3-space by 
the w.l.f. f^^, which is given by f^^{0,a) = 1 if and only if: (a) dima = 3 
and a G O; (b) dima = 0,2 and a G supp(O); (c) dima = 1 and one of the 
following conditions holds: 

(cl) sts(a; R^) C O 

(c2) a G supp(O) and st 3 (a; O) = sta(a; O) 

(c3) a G supp(O) and there exist cr, r G stg(a; O) with a D t = 0. 

Remark j. According to [3, Def. 11], it is not difficult to prove that the w.l.f. 
fBM pj.Qvi(;jes tjge (26, 6)-connectedness. That is, given a digital object O in 
(i?^, f^^), the components of O are exactly the 26-components of O; and more- 
over, O' is a 6-component of cell 3 (i?^) — O if and only if it is an O-component. 

Our main result is now stated as follows. 
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Theorem 5. A connected object S in the digital space is an f^^~ 

surface if and only if S is a strong 26-surface. 

In [10] Malgouyres and Bertrand characterize strong 26-surfaces within the 
family of strongly separating objects. We shall use this characterization in the 
proof of our Theorem 5 rather than the original definition of strong 26-surfaces, 
which is given in terms of strong homotopy [5]. For this we need some more 
notation from [10]. Here we use the identification of the grid C IR^ with the 
set cells (i?^) of 3-cells in R^. 

Given a digital object O in R^ let Nk{(r; O) denote the set of 3-cells t G O 
(cr ^ t) which are k-adjacent to a {k = 6, 18, 26), and let Ge{a; O) be the set of 
3-cells r G N 2 q{<j\ O) such that there exists a 6-path in st^cr; O) from cr to r of 
length less than or equal to 2. If Nk{<7; O) vie say that a is k-adjacent to O. 
Observe that N2q{(t;0) = st3(cr;0) — {a}. Finally, we recall that an object O 
is said to be strongly separating if — O has two 6-components and moreover 
each a G O \s 6-adjacent to both components (see [5]). 

Definition 6. ([10, Defs. 6 and 7[) Let S be a 26-connected object in . Then 
S is said to be a near strong 26-surface if the following four properties hold for 
all a G S. 

1. N 2 e{<T\Z^ — S) has exactly two 6-components Af and A 2 which are 6- 
adjacent to a. 

2. Ge{a; — S) has exactly two 6-components. 

3. For each r G N 2 e{cr;S) the sets N 2 e{T] Af) and N 2 c,{t-, Af) are non-empty. 

4. For each r G Ng(a; S) the sets Ge{a;Af U {r}) and Ge{a;A 2 U {r}) are 
6-connected. 

Then, strong 26-surfaces are characterized as follows. 

Theorem 7. ([10, Th. 4]) Let S be a 26-connected strongly separating object in 
2Z^ . Then S is a strong 26-surface if and only if it is a near strong 26-surface. 

Moreover, in [10] it is also proved 

Proposition 8. ([10, Pr. 1]) Any strong 26-surface is strongly separating. 

In the rest of the paper we will derive Theorem 5 from Theorem 7 and 
Proposition 8 as follows. In Section 4 we prove that any near strong 26-surface is 
an /^^-surface; and then, in Section 5, we show that any /^-^-surface is both 
a near strong 26-surface and strongly separating. 

It is worth pointing out that, although the notion of near strong 26-surface is 
local, the characterization given in Theorem 7 is not completely local because the 
notion of strongly separating is a global one. To obtain a local characterization of 
strong 26-surfaces it remains to prove that any near strong 26-surface is strongly 
separating. This is done in [10, Th. 6] for the class of finite strong 26-surfaces. 
As a consequence of our main result (Th. 5) this local characterization is now 
extended to possibly infinite strong 26-surfaces. Namely, 

Theorem 9. (Extension of Malgouyres-Bertrand Theorem) Let S be a 

26-connected object in . Then S is a strong 26-surface if and only if it is a 
near strong 26-surface. 




A Digital Lighting Function for Strong 26-Surfaces 



97 
















72 


1 


r 

j 


k 




4 









(a) (b) 

Fig. 2. A non-simple maximal elementary cube and its canonical cycle. 



4 Near strong 26-surfaces are /^^-surfaces 

In [10], Malgouyres and Bertrand define a continuous analogue Us for any near 
strong 26-surface S which is, in fact, a triangulation of a surface embedded in 
IR^. Hence, to prove that S is an /^'“-surface it will be enough to show that 
the simplicial analogue of S' is a subdivision of Ss- For this we recall more 
notations and results from [10] in the following paragraphs. 

An elementary cube is a closed unit cube with vertices in . Notice that the 
vertices of an elementary cube C are the centroids of eight 3-cells in which 
share a common 0-cell, and this 0-cell is the center of C. Given a digital object 
O, we say that an elementary cube C is maximal with respect to O if C n O 0 
and C n O = C" n O whenever C n O C C" n O for some other elementary cube 
C' . Finally, a maximal elementary cube C is said to be simple if for any 3-cell 
a & C C\ O the difference C n 2Z^ — O consists of 3-cells in 6-components of 
N 2 e{(T; TZ,^ — O) which are 6-adjacent to cr; otherwise C is said non-simple. 

Given a maximal elementary cube C with respect to a near strong 26-surface 
S, the vertices of C n S' can be arranged in a canonical cycle. If C is simple such 
a cycle is defined in [10, Lemma 5] as the subgraph induced by the vertices of 
C n S in the Id-adjacency graph of S; two 3-cell cr, t G S are said to be Id- 
adjacent if either they are 6-adjacent or they are 18-adjacent and no 3-cell in S 
is 6-adjacent to both cr and r. Now assume C is non-simple. Lemma 3 in [10] 
ensures that, up to symmetries or rotations, the only possible configuration of 
C n S is that in Fig. 2(a), where cr G S and t G G n — S is the only 3-cell 
in a 6-component of N2c,{cr,2Z^ — S) other than and in Definition 6. 
Moreover, both components Af, Af have representatives in the set of 3-cells 
{C n — (S' U {r}). Without lost of generality, assume that ti,T 2 are in AJ 
(see Fig. 2(a)). Then, the proofs of Lemmas 6 and 7 in [10] actually show that 
the eight cells depicted in Fig. 2(a) on the left and right hands of C are all in 
— S. Now, in [10, Lemma 8] a canonical cycle in G n S is defined as it is 
sketched in Fig. 2(b). 

By the use of canonical cycles, Malgouyres and Bertrand show in [10, Th. 5] 
that the simplicial complex Ss is a topological surface in Each triangle 
T G Ss has as vertices two successive points of the canonical cycle of G n S', 
where G is a maximal elementary cube with respect to S containing T; and its 
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third vertex is either the center of the common 2-face of C and C , if there exists 
a maximal elementary cube C ^ C such that C" n S' = C n S', or the center of 
C otherwise. 

In order to compare Ss with the simplicial analogue As we need the following 
lemmas. 

Lemma 10. Let O he a digital object in and C an elementary cube such that 
C C\ O ^ The following two properties are then equivalent. 

1. If C is another elementary cube such that C D O C C' D O , then C = C. 

2. If a G is the 0-cell in the center of C, then a G supp{C n O). 

In particular, if any of these conditions holds, the elementary cube C is maximal 
with respect to O and c(a) G Ao- 

The proof of Lemma 10 is immediate from the definitions of elementary cube 
and the support of an object. We use also the next lemma concerning centroids 
of 1-cells. 

Lemma 11. Let S be a near strong 26-surface and a an Tcell in R? . Then 
c{a) G Als if and only if the two elementary cubes C\,C 2 which meet in the 
2-face whose center is c(a) are both maximal with respect to S and, moreover, 
one of the two following conditions holds: 

1. Cl n s = C2 n s = 5^3(0; R?) 

2. st 3 {a; S) = {cr, r} and a,r are successive in the canonical cycles of both 
Cl n S and C2 H S. 

Proof. Assume c(a) G As. Then, the definition of f^^ yields that one of the 
following cases occurs: (a) st3(a;i?^) C S; (b) a G supp(S) and st3(a;S) = 
st3(o;;S) yf st3(o;;i?^); and, (c) a G supp(S) and there exist (Ji,cr2 G st3(a;S) 
with (Tl n (72 = 0. 

In case (a) both elementary cubes Ci and C2 are maximal and Ci n S = 
C2 n S = st3(o;;i?^). Otherwise only one of such cubes is maximal and simple 
with no canonical cycle in it. Here we use Lemma 3 in [10]. 

Case (b) cannot occur for a near strong 26-surface S' since it implies that Ci 
and C2 are maximal and simple, by [10, Lemma 3], with no canonical cycle in 
them. 

Finally, if case (c) occurs then both Ci and C2 are maximal, by Lemma 10. 
If, in addition, st3(a; S) contains only two 3-cells, these are Id-adjacent and so 
they are successive vertices in the canonical cycles of Ci C S and C2 C S (being 
Cl and C2 simple or not). 

We finish the proof by showing that st3(o;; S) does not contain three 3-cells 
(four 3-cells is case (a)). Otherwise we have 3-cells cr, r, p G S and S ^ S 
as in Fig. 3. By property (3) in Definition 6, p must be 26-adjacent to both 
6-components Af and A 2 which are 6-adjacent to a. From this one readily de- 
rives that X G S or y,y' G S. As both Ci and C2 are maximal and simple, S 
belongs to either A{ or A 2 , and hence p ^ S; here we use Lemma 3 in [10]. 
Now, we claim that pi,p 2 ^ S. Otherwise, if /ii G S' one applies property (3) in 
Definition 6 to pi and p and gets that p 2 € S. Hence, 5 ^ A\[J AI 2 , which is a 




A Digital Lighting Function for Strong 26-Surfaces 



99 




Fig. 3. The center of a 2-face of an elementary cube which has exactly three of its 
vertices in a near strong 26-surface S is not lighted for S. 



contradiction; and similarly for /i2 € S. Finally, tt G S' by hypothesis and there 
is no canonical cycle in C 2 H S. 

Conversely, it is straightforwardly proved that condition ( 1) in the hypothesis 
implies case (a), and condition (2) together with the maximality of Ci and C 2 
yields case (c). 

We are now ready to prove 

Proposition 12. Let S be a near strong 26-surface in . Then, the simpli- 
cial analogue As of S in (R^,f^^) is a subdivision of Es- So that, S is an 
f^^ -surface. 

Proof. To prove this result it is enough to show that: (1) Any triangle of Es is 
the union of two triangles of .4^; and (2) any simplex in .4s is contained in some 
triangle of Es. 

First we prove (1). By definition of Es, its triangles can be drawn inside 
an elementary cube in exactly three ways (up to rotations or symmetries) as 
sketched in Fig. 4. In this figure, black dots are points in S, while circles are 
points not in S. Moreover, in Fig. 4(a) the only vertices of the elementary cubes 
belonging to S are exactly the four black dots, but in Fig. 4(b) and (c) there 
must exist at least another vertex in S in addition to the two black dots. Now, 
it readily follows from Lemmas 10 and 11 that the triangles in Fig. 5 are in As 
and subdivide the corresponding triangles of Es. 

To prove (2) we first observe, from Lemmas 10, 11 and the definition of f^^ , 
that all the vertices of As belong to | 27s |. We next prove that any triangle 
a G Ms with vertices in centroids of cells oi < «2 < as, is contained in | Es \. 

If dim 03 = 2, and thus dimai = 0 and dim«2 = 1 then, Lemmas 10, 11 
and the definition of f^^ yield that the elementary cube with center c(«i) is 
maximal and simple with no canonical cycle in it; here we also use Lemma 3 
in [10]. Thus, this case cannot occur, and hence there are no tetrahedrons in 
As. Therefore dim 03 = 3, and a lie inside an elementary cube (up to rotations 
or symmetries) in the three patterns depicted in Fig. 5. Moreover it is readily 
checked from Lemmas 10, 11 and the definition of f^^ that the corresponding 
triangles of Fig. 4 are in Es. Hence (2) holds for triangles. Similarly for edges. 
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Fig. 4. The triangles of Ss- 





5 /^^-surfaces are strong 26-surfaces 

In this section we prove that any given /-^^-surface S in is a strongly sepa- 
rating set, and moreover, the four properties in Definition 6 hold for all cr G S'. So 
that, by Theorem 7 , S is a strong 26 -surface. In this proof we use the following 
properties of . 

Remark 13 . Let O be a digital object in {R?,f^^) and let a < ct G O with 
fBM^O^a) = 0 . If dima = 2 then a = cr n r with r a 3 -cell in cell3(i?^) — O. 
Moreover if dima = 0 then the definition of yields that a ^ supp(O), and 
from Lemma 10 it follows that the elementary cube C with center in a = c(a) 
contains a 2 -face whose four vertices are (the centroids of) 3 -cells in cell3(i?^) — O. 
Finally, if dima = 1 then two cases are possible: 

Case a: a ^ supp(O). Then c(a) is the center of a 2 -face A common to two 
elementary cubes, and the vertices of an edge of A are (centroids of) two 3 -cells 
in cell3(i?3) - O. 

Case b: a G supp(O). If /3i,/?2 G are the two 0 -cells with /3i,/32 < oi, then 
the definition of implies that = 1 and = 0, where 

{bj} = {l,2}. 

Lemma 14 . Let O be a digital object in {R^, and a < a G O. //c(a) ^ Ao 
then st^{a; R^) — O and Gg (cr; st3(a; R^) — O) are respectively contained in a 
6 -component of stl{a; R ^)—0 and G6(cr; Z^—O), which are 6 -adjacent to a. Fur- 
thermore, Ge{a; sts{a; R^) — O) = % if and only if dima = 1 and 0(71), 0(72) G 
Ao, where 71,72 < cr are the two 2 -cells with a = 71 n 72. 

Proof. For st3(a;i?^) — O the lemma is immediate from Remark 13 . Also Re- 
mark 13 directly implies the result for Gg(cr; stg (a; i?^) — O) except in the case 
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dim a = 0, for which it suffices to check the essentially distinct six patterns of 

0 in st 3 (a; R^). Moreover, the characterization of G 6 (cr;st 3 (a; R^) — O) = 0 is 
also derived from Remark 13. 

Another important ingredient in the proofs below is the notion of relative 
ball in (IR”,M), where M C ]R" is a combinatorial (n — l)-manifold; see [2]. 
More explicitly, a relative ball in (IR", M) is a pair of balls (R”, such that 

R”-i C R", n aS” = and R” n M = R”"b 

Remark 15. Let / be a w.l.f. on R" with | Ann \ = ]R". Assume that M is a 
connected digital (n— l)-manifold without boundary. Then the classical Jordan- 
Brouwer separation theorem states that | | — | Am \ = IR" — I -^m \ has two 

connected components. Moreover, any relative ball (R",R"“^) in (IR", | Am |) 
verifies that R" — R"“^ has exactly two components, each of which is contained 
in a distinct component of IR” — | Am |; see [2]. Furthermore, the components 
of IR" — I Am I determine the M-components of cell„(R") — M according to 
Theorem 2. 

In particular, as f {cells (R^) , a) = 1 for all a & R?, the continuous ana- 

rB M Q „ , , 

logue I I is IR . Thus /-““-surfaces satisfy a digital Jordan property; that 

is, if S is an /““-surface then cells (R^) — S has two R-components, and, ac- 
cording to Remark 4, two 6-components. Moreover, if (R^,R^) is a relative ball 
in (IR , I Ag I) then the components of R"^ — R^ determine the 6-components 
of cells (R3) -R. 

For the following proposition recall that given a simplicial complex K the star 
and the link of a vertex v G K are the sets st(u; K) = {a G K; a < t G K,v G t} 
and lk(w; K) = {p G st(u; RT); v ^ p}, respectively. 

Proposition 16. Let f be a w.l.f. on R" such that f{M,a) = 1 whenever 
stn{a; R") is contained in the digital object M. If M is a digital {n— l)-manifold 
in {R"',f) then the pair (| st{c{a)\ Afc^) |, | st{c{a)]AM) I) is a relative ball in 
(IR", I Am I) for each cell a € R" with c{a) G Am- 

Proof. It is readily derived from the hypothesis that | Ar^ \ = IR”. Then, as 

1 Am I is a combinatorial (n — l)-manifold without boundary, st(c(a);MM) C 
st(c(a); Ann) are balls of the corresponding dimensions. In fact, st(c(a); Am) = 
st(c(a); P Am since Am C M/jn is a full subcomplex. The same argu- 
ment yields lk(c(a); Am) = lk(c(o;); n Am- Notice that lk(c(a); Mm) is the 
boundary of st(c(a); Am)- 

Remark 17. According to Remark 15, = \ st(c(a); | — | st(c(a);MM) | 

has two components. Moreover, it is easy to prove that for a component C there 
always exists a cell /3 e R” with c(/?) e C. For this one uses that Am is full in 
M_Rn. 

Proposition 18. Any connected digital f^^ -surface S in (R^, /““) is strongly 
separating. 
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Proof. Since S is an /^^-surface we know, by Remark 15, that — S = 
cell3(i?^) — S has two 6-components. So, it suffices to check that each cr G S' is 
6-adjacent to both of them. By Remark 15, one also knows that the 6-components 
of — S are determined by the (continuous) components Ci , C 2 of Dcr = 

I st(c(a); Alijs) | — | st(c((j); Alg) |; see Proposition 16. Then, the result follows 
from the existence of c(ai) G Ci, i = 1,2 (see Remark 17) and Lemma 14. 

Proposition 19. Any connected digital -surface S in (R^,f^^) is a near 
strong 26-surface 

Proof. It will suffice to prove that properties (l)-(4) in Definition 6 hold for each 
a G S. 

Property (1) will follow if we prove that N 26 {a; — S) = st3((r; R^) — S 

has at most two 6-components which are 6-adjacent to a, since it has at least 
two as S is strongly separating. And this follows by showing that if (Ti,CT 2 ^ S' 
are 6-adjacent to a, the existence of a path c(cr n cri) = 0(70 ), . . . , 0(7^) = 
c(cr n CT2) in the difference = \ st (c(cr) ; A/^a) | — | st(c((r); As) | implies that 
(Ti and (72 are in the same 6-component of N 2 e{fr; — S). Without lost of 
generality assume 7^ < 7i_i. Then st3(7i_i; i?^) — SC st3(7i;i?^) — S and 
moreover they are non-empty sets since f^^ satisfies f^^{0, a) = 1 whenever 
st3(o;;i?^) C O. Then Lemma 14 and an inductive argument on k show that 
{cti, CT 2} C uf,^gSt3(7i; R^) — S is contained in some 6-component of N 2 Q{a; — 
S) which is 6-adjacent to a. 

To prove property (2) it will suffice to check that G6(cr; — S) contains at 
most two 6-components, since it has at least two by property (1). This is proved 
following the same pattern as for the proof of property (1) above if we show that 
G6(st3(7G R^) — S) are non-empty sets, which is readily derived from Lemma 14. 

For the proof of property (4), let us consider the two 6-components Af and 
A 2 of N 2 q{(t;Z^ — S) which are 6-adjacent to a. Then, it is not difficult to 
prove that the two 6-components of G6(cr; 7Z^ — S) are Ge{a;Af), i = 1,2. So, 
property (4) will follow by proving that any r G N^^a; S) is 6-adjacent to both 
G6 (ct; Af) and CQ{a; AJ). For this we consider the 2-cell 7 = ct n t (notice that 
0(7) G As) then one observes that any centroid c{a) in the difference also 
belongs to Dg. . As the components of both and D^r determine the components 
of I Ar3 I — I As I then two cells oi, 02 <7 can be found with c(ai) G Ci, where 
Ci {i = 1,2) are the components of Do-. Notice also that the proof of property 
(1) above actually shows that Gi and G2 determine the two 6-components Af, 
A2 . In fact, for a cell a G R^, c{a) G Ci if and only if sts(a; R^) — SC Af. 
Now Remark 13 ensures that if dimoj = 0 or dima^ = 1 and Oj supp(S') then 
there exists a 3-cell in G6(cr; st3(aG R^) — S) which is 6-adjacent to t. Otherwise, 
if dim«i = 1 and G supp(S'), from Remark 13 there is a 0-cell (3 < a with 
fBM^S,f3^ _ Q applies the previous case to /3. 

Finally we prove property (3). From the definition of f^^ we first derive 
that for any r G N 2 e{(j; S) there exists < r n cr with c(ar) G dst{c{a); As). 
If dim a,- = 2 the proof above of property (4) yields the result. Otherwise, if 
dima,- < 2, one finds a centroid c(«i) in each component Ci C D^j {i = 1,2) 
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such that ai < or In any case all 3-cells in st 3 (o;i; R^) — S C 

are 26-adjacent to r. Here we use again the fact that f^^{S,a) = 1 whenever 
st„(o;; R^) C S. 
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Abstract. The purpose of this paper is to define the notion of “real” 
intersection between paths drawn on the 3d digital bonndary of a con- 
nected object. We consider two kinds of paths for different adjacencies, 
and define the algebraic number of oriented intersections between these 
two paths. We show that this intersection number is invariant under any 
homotopic transformation we apply on the two paths. Already, this in- 
tersection nnmber allows us to prove a Jordan curve theorem for some 
surfels curves which lie on a digital snrface, and appears as a good tool 
for proving theorems in digital topology about surfaces. 



Key words: Digital topology, digital surfaces, surfels curves. 



Introduction 

Digital surfaces of three dimensional objects have proved to be a fruitful model 
for visualisation and analysis of the objects they represent ([2]), especially in the 
biomedical field. Efficient algorithms for extracting surfaces from volumes, and 
computing shape characteristics exist ([4]). Sometimes, the surface itself needs 
to be segmented since some anatomic points are defined on it. Then we obtain 
some subset X of the set of the surfels of the surface. In [5], several tools of 
image analysis are defined in this nonplanar framework to analyse the topology 
of the subset X. 

In particular, in [5], two complementary adjacency relations between surfels 
on a digital surface are introduced, the e— adjacency and the w— adjacency, and a 
notion of topology preservation is proposed. Given Y C X two subsets of a digital 
surface, a relationship is established, between the property of X to be reducible 
to Y by sequential deletion of simple surfels, and the fundamental groups ([3]) 
of Y and X . However, the conditions obtained in [5] seem redundant, and the 
authors conjecture that a stronger theorem exists. Such powerful results are 
now difficult to prove because of the lack of tools for proving theorems about 
homotopy of closed paths. 

On the other hand, topological properties of curves, well known in the 2D 
planar discrete case, have not yet been studied for curves on digital surfaces. 
However, the nonplanar framework imposes us to take into account the notion 
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of homotopy properties of the considered curves. Again, the need for the devel- 
opment of a tool for proving theorems about homotopy of curves or closed paths 
appears. 

The purpose of this paper is to propose such a tool. Given two paths c and tt 
on a digital surface, we define a nontrivial integer invariant called the inter- 
section number ofn and c, an e— path and a u— path, which represents intuitively 
the algebraic number of transverse intersections between c and tt, summing alge- 
braically the intersections with a sign depending on their orientations. We prove 
that the number X,r,c is invariant when c or tt ranges within a homotopy class of 
paths. In other words, the number which can be computed with the data 
of c and tt, is in fact a function of the homotopy classes of c and tt. 

This invariant can for instance be used as follows: given a closed path tt, we 
can prove that tt is not homotopic to a trivial path by exhibiting another path c 
such that ^ 0. We illustrate this principle by proving a Jordan property for 
those of the simple closed curves which are homotopic, as closed paths, to a trivial 
path. The intersection number appears as a good tool for proving theorems in 
digital topology, and we expect to use it in the future to prove more results, in 
particular concerning topology preservation within surfaces. 



1 Basic notions and definitions 

To define the notion of a digital surface, we must recall few notions of digital 
topology. First, we consider objects as subsets of the 3 dimensional space T? . 
Elements of 2? are called voxels (short for “volume elements”). Voxels which 
do not belong to an object O C constitute the complement of the object 
and is denoted by O. Any voxel can be seen as a unit cube centered on a point 
with integer coordinates : v = (i,j,k) € Z^. Now, we can define some binary 
symetric antireflexive relations between voxels. Two voxels are said 6— adjacent 
if they share a face, 18— adjacent if they share an edge and 26— adjacent if they 
share a vertex. For topological considerations, we must always use two different 
adjacency relations for an object and its complement. We sum this up by the 
use of a couple (n, n) with {n, n} = {6, 18}, the n— adjacency being used for the 
object and the n— adjacency for its complement. By transitive closure of these 
adjacency relations, we can define another one : connectivity between voxels. We 
define an n-path tt with a length k from a voxel a to a voxel & in O C Z^ as a 
sequence of voxels (ui)j=o...fe such that \/0 < i < k, Vi is n-adjacent or equal to 
Vi+i, Vo = a and Vk = b. Connectivity now can be defined; two voxels a and b 
are called n-connected in an object O if there exists an n-path tt from a to 5 in 
O. This is an equivalence relation on voxels of an object O, and the n— connected 
components of the object O are equivalence classes of points according to this 
relation. Using this equivalence relation on the complement of an object we can 
define a background component of O as a connected component of O. 
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1.1 Digital surface 

In this paper, we are interested by surfaces constituted by the boundary between 
a 6— connected or 18— connected subset O of 7? and V, one of its background 
components. As in [7] we first define the border between O and V by: 

6{0,V) = {{a,b) I a is 6— adjacent to b, a G O, b G V} 

The set U = 6{0,V) is called a digital surface and has the Jordan property 
(according to the definition given in [7]). 

Each couple (a,b) of S is called a surfel (short for surface element) and 
can be seen as the common face shared by two 6-adjacent voxels, the first one 
belonging to the object, the second one to the background. Note that such a 
face is oriented according to the outward normal and this definition of a surfel is 
more restrictive than the classic one. In fact, we call a voxel face the unit square 
shared by any two 6-adjacent voxels, but a surfel is the oriented common face 
of two 6-adjacent voxels, where the first on is a voxel of O and the second one a 
voxel of V. 

In the sequel of this paper, S = S{0, V) is a digital surface. 



1.2 Surfels Neighborhood 

A surfel in a digital surface shares a given edge with at most three other ones. 
Depending on the adjacency considered for the object (6 or 18), we can define 
an adjacency relation between surfels in such a way that a surfel has exactly four 
neighbors, one per edge (whereas at most 3 other surfels can share a given edge) . 
The definition of this classical regular graph on E can be found for instance in 
[6]. This adjacency is called e— adjacency (short for “edge adjacency”). As in [5] 
we define a loop as an e— connected component of the set of the surfels which 
share a given vertex (see Figure 1). One can see that a vertex is not sufficient 
to uniquely define a loop since a vertex can define two distinct loops. In fact, a 
loop is well defined given a vertex and a surfel incident to this vertex. 

We say that two surfels are u— adjacent (short for “vertex adjacent”) if they 
belong to a common loop. We denote by Nn{x) for n € {e, v} the n-neighborhood 
of the surfel x, i.e. the set composed of the surfels of E which are n-adjacent to 

X. 

In the case when O is considered as 18— connected, we avoid some special 
configurations by the assumption that any loop of the surface is a topological 
disk. A formal way to express this assumption is to say that two u— adjacent 
surfels which are not e— adjacent cannot both belong to two distinct loops. An 
equivalent formulation can be stated as follows: we assume that if the object O 
the surface of which we consider is studied with 18— connectivity, and if there 
exists in O two 18— adjacent voxels which are not 6-adjacent (see Figure 2) 
then, at least one of the two following properties is satisfied: 

— The two voxels have an 18— neighbor in O in common. 

— The voxels have two 26— neighbors in O in common which are themselves 
26— adjacent. 
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Fig. 2. A pathological case for which a 
Fig. 1. Example of a loop. loop is not a topological disk. 



This restriction is necessary and sufficient to ensure that a loop is a topological 
disk. However, given a surfel x G S, we will also need to define a topology on 
the neighborhood Ny{x) under which it is a topological disk too (see [5]). For 
this purpose, we define the Ca,— adjacency relation: 

Definition 1 Let x he a surfel of a digital surface E. We say that two surfels y 
and y' of Ny{x)U{x} are e^— adjacent if they are e— adjacent and both contained 
in a loop which contain x. 

This relation allows us to deal with the e^;— connected component of Ny{x) n X 
where AT is a set of surfels. 

1.3 Surfels paths 

Let n € {e, u}. 

Definition 2 An n-path c with a length I in a digital surface E is a sequence 
{sk)o<k<i of 1+1 surfels for which Sk is n- adjacent or equal to Sk+i for 0 <k <1. 
A closed n-path is a path such that si = Sg. 



Notation 1 For a sequence s = {si)o<i<n of surfels, we denote s* = {si| 0 < 
i < n}. 

Definition 3 ( Concatenation of paths) If a = (oq, . . . , a„) and b = {bo, ■ ■ ■ , b„) 
are two n— paths such that an is n— adjacent or equal to ho then we denote a.b = 
(ao , ... ,an,bo, ■ ■ ■ , bm), which is called the concatenation of the two paths a and 

b. 

1.4 Homotopy of paths, Fundamental Group 

The fundamental group in digital topology, as defined by Kong in [3], has shown 
to be a very useful tool for studying topology of digital sets. Here, we define an 
analogue to the fundamental group for digital surfaces. First, we must define a re- 
lation of elementary deformation between paths, the elementary Tt-deformation 
relation. 
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Definition 4 (elementary H— deformation and homotopy of paths) Let X C Id- 
Let c = {xi)i=Q,... ,q o-nd c' = (x')i=o,... ,q' two v—paths in X. The path c is 
said to be an elementary TL- deformation of c' in X if c = 7 ri.a. 7 T 2 , d = -kx-o! . 1^2- 
Where a and a' have the same extremities and are both contained in a common 
loop of the surface E (note that the path tti or tt 2 can be empty). 

We define the n-homotopy relation between paths as the transitive closure of the 
elementary Tl- deformation: two n-paths c and c' are called n-homotopic (which 
we denote by c c' ) if there exists a sequence of H- deformations which allows 

us to obtain one from the other. 



Remark 1 If a path c is not closed, then any path c' which is n— homotopic to 
c must have the same extremities as c. 

Now, the set of equivalence classes of closed n— paths from a surfel b (called 
the base point) to b in E, given an operation on classes induced by concatenation 
of paths, has a group structure. It is called the fundamental group of E with base 
point b and denoted by IIn{b, E). This (generally non-abelian) group structure is 
very useful for studying the topology of objects in the digital spaces and 7? or 
surfaces. It is used for example for the characterization of topology preservation 
([l]and[5]). 

This first and simple definition of elementary deformation of paths will be 
too “large” in our context and proofs in the sequel will need to consider a more 
restrictive transformation for u— paths with simpler conditions, which leads to 
the same notion of homotopy. 

Definition 5 Let X C E, c = (xi)i=o,... ,g (ind c' = (a;()i=o,... ,<?' be two v—paths 
in X. Then, c is said to be an elementary C- deformation of c' in X if c = 
7Ti.(si, S2, S3).7T2 and c' = 7Ti.(si, S2).7T2. Where the three surfels si, S2 and S3 
belong to a common loop of E. We define the C-deformation as the symmetric 
and transitive closure of the elementary C— deformation. 

In other words, the elementary relation links two w-paths which are almost 
the same except that one is obtained by deletion in the other of a surfel which 
belongs to the same loop as its predecessor and successor. Now, we can state the 
following property : 

Proposition 1 Two v—paths c and d are v— homotopic in a surface X C E if 
and only if one is an C— deformation of each other in X. 

2 The intersection number : Definition 

In the following, paths all lie on a digital surface E. If a path is closed, the 
subscripts of surfels must be read modulo I where I is the length of the path. 

Notation 2 (Vertices and oriented edges) Since a surfel has four vertices, we 
can order these vertices as in [6] by distinguishing one vertex for each type 
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of surfel (we distinguish 6 types of surfels according to the outward normal 
vector) and impose a turning order for vertices around the outward normal to 
the surfel. Each vertex of a given surfel is associated with a number in {0, 1, 2, 3}. 
With this parameterization of vertices we can define oriented edges as couples of 
consecutive vertices according to the cyclic order. So, for each surfel, we have the 
four following oriented edges : (0, 1), (1, 2), (2, 3) and (3, 0). If {a, b} C {0, 1, 2, 3}, 
we denote as an interval [a, b] the set of vertices met when looking after vertices 
from a to 6 in the cyclic order defined before. So, we can write c G [a, b] if 
the vertex c is met when looking for vertices from a to b. For an e— path tt = 
{yk)k=o,... ,p and for k G {0,... ,p}, we defme front ^{k) (resp. back^{k)) as the 
oriented edge (a, b) with a,b G {0, 1, 2, 3} of the surfel yk shared as an edge by 
yk and yk+i (resp. yk and yk-i)- Remark that &ocfc.„.(0) and front^{p) are not 
defined if tt is not closed. 



Definition 6 Let tt = (yk)k=o,... ,p be an n—path. We say that tt has a local back 
and forth at the surfel yk if yk-i = J/fc+i- 



Remark 2 Suppose that tt has no local back and forth. Then, if (a, b) = 
backTr(k) and (c, d) = front^{k) then a ^ c and b ^ d. Indeed, since the edges 
are oriented, a = c implies that b = d; this would mean that the front and back 
edges of yk are equal, so that we would have yk-i = yk+i- 

We want to define locally, at each point of an e— path the right side and 
the left side on the surface, taking into account the orientation of the surface 
(Definition 8 below). 

Definition 7 Let tt = {yk)k=o,... ,p be an e—path. For k G {I, . . . ,p — 1} (k G 
{0, . . . ,p} if TT is closed), we define the sets L^{k) and R^i^k) both included in 
{0,1, 2, 3}. Let {a,b) = &acfc^(/c) and (c,d) = front^{k) . Lf yk-i yf j/fc+i then we 
define LT^{k) = [d, a] and R-n{k) = [b,c] (see Figure 3). Lf yk-i = yk+i we define 
LTr{k) = RTr{k) = 0. Note that LT^{k) and R-nik) are also defined for k = 0 or 
k = p in the case when tt is a closed e—path. 




Fig. 3. A surfel yk of tt, Fig. 4. A surfel yk of tt, 

and the two sets LT,{k) and the two sets Le/t^(fc) Fig. 5. An example for 
and RTr{k). and Right^(k). which Right^ik) = 0. 
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Notation 3 For a surfel x and a given vertex number w € {0, 1, 2, 3} we denote 
by £w(x) the unique loop associated to the vertex w of x which contains the 
surfel X. 

Definition 8 Let tt = {yk)k=o,... ,p be an e—path. For 0 < k < p (0 < k < p if tt 
is closed) we define the set of surfels Left^{k) and Right^{k) by: 

LefUk) = [ U £w{yk)]\{yk-i,yk,yk+i} and 

w^Ltj- (k) 

Right^{k) = [ y Cwivk)] \ {yk-i,yk,yk+i}- 

wGRTv(k) 

Note that one of these two sets can be empty if {yk-i,yk,yk+i} constitute a 
loop with a length 3 (i.e. yk-i is e— adjacent to yk+i)- Such a case is depicted by 
Figure 5. In the case when yk-i = Vk+i, both sets Left^{k) and Right ^{k) will 
be empty since L^^^k) and RT^{k) have been defined as empty in this case. See 
Figure j for an example of such sets. 

Remark 3 Due to our assumption that loops are topological disks, we have: 

Left.^{k) n Right j^{k) = 0. 

If yk-i is not e-adjacent to yk+i then Ny{yk) \ {yk-i,yk+i} has exactly two 
Cyj, —connected components (see Definition 1): Right .^{k) and Left.^{k). 

In the sequel of this paper we have set some restrictions on the paths studied. 
In this goal, we define a property (P. 

Notation 4 Let c = (xj)i=o,... ,q be a v—path and tt = {yk)k=o,... ,p be an e—path 
in E. We say that the property (P(7r, c) is satisfied if when c is not closed then 
co,Cq ^ 7T* and when tt is not closed then yo,yp ^ c* . 

Definition 9 Given a v-path c = (xi)i=o,... ,q and an e-path tt = {yk)k=o,... ,p 
such that ‘J’(tt, c) holds, we introduce the contribution to the intersection number 
of a couple {k,i) with k G {1, ... ,p — 1} ('{O, . . . ,p} if tt is closed) and i € 
{1, 1} ('{0, • • ■ , 9 } if c is closed) by I^^dk, i) = J~^y{k, i) + I)f^y{k, i) 

where: 

= 1 if Xi = Vk, Xi-i e Right,,{k). if Xi = yk, Xi+i e Right„{k). 

ifxi = Vk, Xi-i e Left^{k). X+^(fc,i) = | if Xi = yk, Xi+x G Left^{k). 
2~„(k,i) = 0 in all other cases. Xf ,.(k,i) = 0 in all other cases. 

Note that I~y{k,i) = 0 (resp. I+^{k,i) = OJ if x^ ^ yk or yk-i = 2/fe+i or 
Xi-i G {yk-i,yk,yk-ki} (resp. Xi+i G {yk-i,yk,yk-ei})- If yk-i = yk+i, we have 
I~y{k,i) = Tif y{k,i) = 0 since Left^{k) = Right ^{k) = 0. 

In other words, J~y{k, i) depends on the position ofxi-i relative to the e-path 
TT at the surfel yk. And Fif y{k,i) depends on the position o/xj+i. 
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Fig. 6. A n— path c (light grey) and 

an e— path tt (in dark grey) such that Fig. 7. A w— path c and an e— path tt 
1-K,c = 0. such that = ±1- 

Definition 10 Let c = (a;i)j=o,... be a v—path and let tt = {yk)k=o,... ,p be an 
e-path satisfying the property ‘T{tt,c). The intersection number of the v—path c 
and the e—path tt, denoted by is defined by: 

p—lq—1 

'^'K,C ^ ^ ^7T,C ^ ^ ^ ^ ^7T,c(^5 ^ ^ ^ ^ ^7r,c(^7^- 

Figure 6 and Figure 7 show two examples of intersection numbers. 



Notation 5 We denote: 

i;.c(*) = and J^,,(fc) = |]X^,,(fc,z) 

k—Q 7=0 



3 Independence up to homotopy 

The purpose of this section is to state and sketch the proof of the two following 
theorems: 

Theorem 1 Let tt = {yk)k=o,... ,p be an e—path, c = (a;j)i=o,... ,9 ei^d c' = 
(x')i=o,... ,q' be two V— paths on a digital surface S such that T(7t, c) and ‘?(tt, c') 
hold. Lf c' is V— homotopic to c in S (in U \ {z/ 0 ) 2 /p} if is not closed), then 
7 — 7 , 

-^7T,C — -^7T,C' • 



Theorem 2 Let c = (xi)i=o,... ,g be a v—path of a digital surface S , tt = 
{yk)k=o,... ,p eind tt' = ( y ^) fc = o ,... ,p' be two e— paths such that (P( 7 r, c) and (P( 7 r', c) 
hold. Lf tt' and tt are e— homotopic in S (in i7\{a;o,Xg} if c is not closed), then 
7 =7 I 

-^7T,C -^7T ,C’ 
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We first prove Theorem 1, and Theorem 2 will appear as a corollary of Theorem 1. 

Since n— homotopy and T— deformation are equivalent relations, then we can 
prove the equality when considering the latter relation between the paths c 
and c' . Furthermore, to prove these properties for any £— deformation, it is 
sufficient to prove it for an elementary deformation. Now, in order to prove 
this equality, we consider the contribution of intersection intervals between the 
two paths 7T and c on a first hand, tt and d on the other hand. But these intervals 
can have very bad properties when considering the general case, especially when 
7T has local back an forths (see Definition 6). So, to avoid this special cases, we 
first suppose that the path tt has no local back and forth and prove Theorem 1 
in this case. In a second step, we prove that the intersection number between 
a u— path c and an e— path tt is left unchanged when one removes all back and 
forths in tt. 

First we state some technical lemmas. 

3.1 Important lemmas 

Let TT = {yk)k=o,... ,p be an e— path with no local back and forth on S, then we 
have the three following lemmas: 

Lemma 2 Let k G [l,p — 1] if tt is not closed and k G [0,p] otherwise. If 
xi and X 2 are two surfels of a given loop containing the surfel yu such that 
{xi, X 2 } n ?/fc, 1 /fc+i} = 0, then we have either {xi,X 2 } C Left.^{k) or 

{xi,X 2 } C Right .^{k). 

Lemma 3 Let k G [l,p — 2 ] if tt is not closed and k G [0,p — 1] otherwise. 
If X is a surfel such that x ^ {j/fc-i, 2/fc, J/fc+i, yfc+ 2 } and x, yk and yk+i are 
included in a common loop, then: x G Right^{k) (resp. Left.,^{k)) if and only if 
X G Right^{k + 1) (resp. Left^{k + 1) ). 

Lemma 4 Let k G [l,p — 3] if tt is not closed and k G [0,p — 2 ] otherwise. If 
k is such that yk, yk+i and yk +2 are included in a common loop, yk-i J/fc +2 
and yk+z ^ yk, then yk +2 G Right^{k) (resp. Left.„{k)) if and only if yk G 
Right.,^{k + 2 ) (resp. Left^{k + 2 )). 

3.2 Independence when tt has no local back and forth 

In order to prove the Theorem I, we use the following proposition: 

Proposition 5 Let c = {xi)i=Q,... ,q and d = (a;')j=o,... ,g' be two v— paths on a 
digital surface S, let tt = (yk)k=o,... ,p be an e—path with no local back and forth 
such that (P{tt,c) and (P(7r, c') hold. If d is an elementary £— deformation of c 
in S (in E \ {yo,yp} if tt is not closed) then 

In the sequel of this subsection, c and d are u— paths satisfying the hypothesis 
of Proposition 5. From the very definition of the elementary deformation, d 
is obtained by the removal in c of the surfel xi {0 < I < q if c is not closed. 
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and 0 < I < q otherwise). This means that c' = (xq, . . . , where x' = Xi if 
0 < i < I and x' = x^+i if I < i < q. 

q-l q-2 

We have to prove that J,r,c = with I,r,c = ^ ^ c/ (i) ■ 



i—0 i—0 

For i = 0, . . . , I — 2 if c is closed and for i = 1, . . . ,1 — 2 otherwise, we have 
Xi_i = x'_]^, Xi = xi and x^+i = x'^j^, so that ^,{i) = ^{i). If c is not closed 

then (0) = = 0 since xq = Xq ^ tt*. Similarly, for i = ? + 1, . . . ,q — 2 

we have x'_^ = Xi, xi = Xj+i and x'^^ = Xi+ 2 , so that + !)• 

Finally we have to evaluate the difference : 



I.,c - I.,d = [i:,M - 1 ) +KM) +Kc(i + 1 )] - [KAI - 1 ) +KAI)] 



( 1 ) 



In order to prove that this difference is zero, we sum the contributions of inter- 
section intervals between tt and the three surfels {xj-i, x;, x;+i} of c on a first 
hand, and between tt and the two surfels {xJ_j^,xJ} of d on the other hand. 

Definition 11 Let b = {zq,... , Zr) be an e-path, and d = {to,... ,ts) be a 
v—path. Let X be a set of integers in [0,s] (X will be either {I — 1,1,1 1} or 

{I — 1,1}). We define the set of intersection intervals between the path b and d 
in X by: 

Ab,d{X) = {[fci, fc 2 ]| Vfc e [ki,k 2 ], 3/i G X, Zk = th and Vi G X, U ^ {zk^-i, Zk^+i}} 
When applying this definition, we shall use b = tt and d = c or d = c' . 



In other words, an interval [ki , ^ 2 ] is a maximal sequence of subscripts of 
surfels of tt which belong to a certain subset of c*. The idea of the following is 
that the deletion of a surfel in c will either suppress, reduce or disconnect such 
intersection intervals but the sum of the contributions of the resulting intervals 
will be equal to the contribution of the initial interval. More precisely, we shall 
prove: 

Proposition 6 Denoting A = Vl,r,c({^ — 1,1,1 -\- 1}) and N = Vl,r,c'({^ — 
then VA G A: 

^ ' LT,^c{k, I — 1) + L.„^c{k, 1) + Tn,c{k, / -t- 1) = ^ ^ {k,l — 1) + {k,l) 

fceA \' ga ' fcgA' 

>'CA 

Sketch of proof: To prove this proposition, we first note that the intersection 
intervals in A must have a length I < 3 or the three surfels x;_i, x; and xi+\ 
must be pairwise e— adjacent as in the only two possible configurations depicted 
in Figure 8. The equality of Proposition 6 is shown by using Lemma 2 for intervals 
whith a length 1, Lemma 3 for intervals with a length 2 and Lemma 4 for intervals 
with a length 3. For intervals with a length greater then 3, the proof uses these 
three lemmas and considerations about periodicity of tt in this special case. □ 
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Fig. 8. Two cases when 3 surfels are pairwise e— adjacent. 

Sketch of proof of Proposition 5: 

Since {k G [0,p] | yu G {xi-i,xi,xi+i}} = u A and the A G ^ are pairwise 

AeA 

disjoint, and similarly for A' , due to equation (1) and Proposition 6, we have: 

T — T / — 

-^7T,C -^7T,C' 

^ I - 1) +F.,c(fc, Y. Y - 1) 0 

Aea fcsA Ae/i £a> kex' 

X' cx 

= 0 . □ 



3.3 Independence under Shrunk operation 

Given an e— path tt = {yk)k=o,... ,p, we define the operations shrunkiir) and 
Shrunk (it): 

Definition 12 Let tt = {yk)k=o,... ,p be an e—path such that that there exists I, 

0 < I < p (0 < I < p if TT is closed) such that yi-\ = yi+\ and for all I' such 
that yi'-i = yi'+i, then I < I' . We define shrunken) as the path obtained by the 
deletion in tt of the two surfels yi and yi+i- Remark that shrunk{Tr) is still an 
e—path since yi-i{= yi+\) is e— adjacent to yi+ 2 - 

Shrunk{'K) is the path obtained after sequential applications of shrunk on tt 
until the resulting path has no local back and forth. 

The following lemma means that removing all back and forths in the e—path 
leaves the intersection number unchanged. 

Lemma 7 Let tt = {yk)k=o,... ,p be an e—path on S and c = (xi)i=o,... ,q o v—path 
on S such that T(7t, c) holds then = '^Shrunk{ir),c- 

In order to prove this lemma, it is sufficient to prove that I,r,c = 1shrunk(-K),c- 
To to this, we first observe that if tt has its first local back and forth at yi 
{I G [0,p] if TT is closed, I G [l,p — 1] otherwise) then ^ — ^shrunk = 
- 1) + + 1) - ^shrunk( 7 r),c(^ ~ I)’ ^^611, W6 Can provc for 

1 0, . . . ^ Q 1 that ^7i-jC 1 1 ; ^ shrunk (tt) ,c(^ ^ 5 ^) 

Several cases must be examined depending on the position of surfels Xi, Xi-i 
and Xi+i relative to the surfels yi- 2 , yi-i = yi+i, yi and yi+ 2 - Each case is 
straightforward. 
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3.4 Proofs of the main results 

Proof of Theorem 1: By induction, it is sufficient to prove Theorem 1 in 
the case when d is an elementary T— deformation of c. If tt' = Shrunk (tt), then 
lemma 7 shows that X,r,c = ,c- Since tt' has no local back and forth, and since 

c' is an elementary £— deformation of d in S (in if \ {j/o, Vp} if tt is not closed), 
then from Proposition 5 we have : Now, Lemma 7 implies that 

^Shrunk{Tz),c' • Finally, T7 t,c -TtTjC^- FI 

Sketch of proof of Theorem 2: It is sufficient to prove Theorem 2 in the case 
when tt' is an elementary 7i— deformation of tt. Hence we assume tt = 7ri.a.7T2 
and tt' = 7Ti.a'.7r2 with a and a' being two e— paths with same extremities and 
being contained in a loop L. 

Then, we prove that c is r;— homotopic to a r;— path d which contains no 
surfel of the loop C so that From Theorem 1, we have = 

-^7T,C' — -^7T ,C' — -^7T,C' ' 

4 A new Jordan theorem 

Definition 13 A simple closed n— curve of surf els in S is an n— connected set 
C of surfels such that for any x G G, the surfel x has exactly two n— neighbors 
in G. 



Definition 14 For a simple closed n— curve G there exists a sequence c = 
(a;i)i=o,... ,9 with q = Card{G) such that for i,j G [0, g] we have Xi,Xj G G, 
and such that Xi is n— adjacent to Xj if and only if i = j + l[q] or i = j — l[g]. 
Such a closed path c is called a parametrization of 6. 

We shall use the following theorem: 

Theorem 3 (see [7]) A digital surface is an e— connected set of surfel. 

Using the notion of the intersection number, we prove: 

Theorem 4 Let (n,h) G {{e,v),{v,e)}. If it = (yk)k=o,... ,p is a parametrization 
of a simple closed n— curve of surfels on a digital surface U, not included in a 
loop, such that tt ttq = (y,y) (tt is n— homotopic to a path reduced to a single 
surfel y), then U \ tt* has exactly two n— connected components. 

In the sequel of this section, tt = {yk)k=o,... ,p satisfies the hypothesis of 
Theorem 4. Moreover, to improve the readability, we assume that n = e. The 
idea of the proof is the same in the case n = v. We need the two following 
lemmas : 

Lemma 8 For any surfel yu of tt there exists two surfels a and (3 such that 
a G Right ^{k) and (3 G Left^{k). 

Lemma 8 is a consequence of the fact that tt is not contained in a loop. 
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Lemma 9 There exists two surfels a and P in S\tt* which are v— adjacent to 
a surfel yk of it and which are not v— connected in E\tt* . 

Proof: Let a and P be the two surfels defined in Lemma 8. Now, we suppose 
by contraposition the existence of a w-path c = (a:i)i=o,... ,g in L'\7r* between the 
surfels a and P, v— neighbors of the surfel yk in tt and which does not intersect 
7T*. We denote c' = (xg = a, .. . ,Xg = P,yk,Q.) which is a closed ri— path. Then, 
from the definition of a and P, Ttt,c' = ±L Now, since tt is e— homotopic to a 
single surfel and from Theorem 2 we should have = 0. This contradicts the 
existence of tt. □ 

Lemma 10 Let yk and yk+i be two consecutive surfels of tt. For any surfel 
s TT* , which is v— adjacent to yk, there exists a surfel t ^ tt* , v— adjacent to 
yk+i, and a v—path from s to t in E\tt* . 

This lemma can be proved by local considerations. 

Proof of theorem 4: From lemma 9 there exists two surfels a and P which are 
f— adjacent to a surfel yk and not v— connected in E\tt* . In particular, E\tt* 
has at least two v— connected components. 

Furthermore, for any surfel x € E \ tt* , since E is e— connected, then there 
exists a w— path c' in 27 \ tt* from a; to a surfel which is u— adjacent to a surfel 
of TT*. 

Using inductively lemma 10, we see that we can prolong the v—path c' to a 
v—path in 27 \ 7T* from a: to a or from x to p. This implies that E\tt* has at 
most two V— connected components. □ 



Conclusion 

We have defined the intersection number between a v—path and an e— path lying 
on a digital surface, and we have proved that this number of “real” intersections 
between two surfels paths is invariant under homotopic deformations of the two 
paths. The intersection number is a new “topological invariant” in the context 
of digital surfaces. Thus, the intersection number has been used to easily prove 
a new Jordan theorem for surfels curves. It appears to be a useful tool for digital 
topology. In further work, we shall use the intersection number to study topology 
preservation in digital surfaces, following the work of [5]. 
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Abstract. A new method of representing a surface in the 3D space as a single 
digitally continuous sequence of faces is described. The method is based on 
topological properties of quasi-manifolds. It is realized as tracing the boundary of a 
growing set of labeled faces. As the result the surface is encoded as a single 
sequence of mutually adjacent faces. Each face is encoded by one byte. The code of 
the surface of a three-dimensional object takes much less memory space then the 
raster representation of the object. The object may be exactly reconstructed from 
the code. Surfaces of a genus greater that zero (e.g. that of a torus) may also be 
encoded by a single continuous sequence. The traversal algorithm recognizes the 
genus of the surface. 



1 Introduction 

We consider here the possibility to represent the boundary surface of an arbitrary 
connected subset of voxels in a 3D space as a single continuous sequence of faces. Such a 
sequence may be encoded by means of some kind of „generalized chain code“ in a rather 
economical way. The described method is also applicable for scanning a connected 
subset of a surface by stepping from one face to another which is adjacent with it, in such 
a way that the set of already scanned faces is „compact“, i.e. it is connected and its 
boundary is not too long. This is important for dissolving a surface into patches each of 
which is a subset of a digital plane similarly to dissolving curves into digital straight 
segments [Kov97]. The well-known methods of encoding surfaces are not applicable for 
this purpose: the method by Gordon and Udupa [GorUd89] dissolves the surface into 
closed loops isolated from each other; the method of representing the surface as an Euler 
circuit [RosKW91] is only applicable to a whole closed surface. 

We consider here the three-dimensional space not as Z? but rather as a Cartesian 
abstract cell complex (ACC) as defined in earlier publications of the author, e.g. in 
[Kov93]. To make the reading easier we have gathered the most important definitions in 
Appendix 1 (Definitions 1 to 12). An ACC has all traditional properties of a topological 
space, i.e. it possesses a system of open subsets (Definition 3 in the Appendix) which 
satisfies the classical topological axioms [Kov89]. Therefore we often speak of a space, 
while implying an ACC. 
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We consider the coordinate axes of the space as one-dimensional ACC’s (Definition 
1) similar to the Khalimsky line [KhalKM90]. We assign subsequent integer numbers to 
the mutually incident cells of the axes: 0-cells of the axes get even numbers, 1 -cells get 
odd numbers. These numbers are declared to be the topological coordinates of the cells. 
There are four different kinds of cells in the three-dimensional space: three-dimensional 
cells (3-cells) are the voxels, 2-cells are the pixels (the same as surfels ox faces), 1-cells 
are cracks (the same as linels or edges) and 0-cells are points (the same as pointels). We 
shall use the term „pixel“ (as in [RosKW91]) instead of „face“ in order to retain the same 
term for a 2-cell both in a 3D and in a 2D space. 

A cell of the space is considered as a Cartesian product of three cells of each of the 
axes: a voxel is a product (i.e. a triple) of a 1-cell of the A-axis, a 1-cell of the T-axis and 
a 1-cell of the Z-axis; a pixel is a product of two 1 -cells and one 0-cell etc. The bounding 
relation in the Cartesian complex is defined as follows: a cell with topological 
coordinates (A„ T„ Z,) bounds a distinct cell (X^, Y^, Z^) iff all three absolute differences 
|Aj-AJ, |Tj-yj and jZ^-ZJ are less or equal to 1, the dimension of the cell A, in the 
complex of the A-axis is not greater than that of A^ and the same condition is fulfilled for 
the other two coordinates. 



2 Boundaries of Sets of Voxels 

We consider here surfaces which are boundaries of sets of voxels. We use the traditional 
topological definition of the boundary (frontier, Definition 4) by means of open sets, the 
latter being defined by means of the bounding relation of the cell complex (Definition 3). 

Our aim is to detect and to encode surfaces, i. e. connected components of boundaries 
of objects in the 3D space. Topological properties of boundaries are of great importance 
for our method. We have found, that the boundary of a connected set of voxels may have 
a rather complicated topological structure even in the case of strongly connected sets of 
voxels (Definition 8): boundaries are not always two-dimensional manifolds (2- 
manifolds. Definition 10). Thus e. g. a crack incident with two voxels of a set V, which 
have no common pixel, is incident with four pixels of the boundary of V, which is 
prohibited in a 2-manifold. It is easy to find examples of strongly connected sets, in 
which a boundary point is incident with two umbrellas (Definition 12) which is also 
prohibited in a 2-manifold. 

An umbrella in the boundary of a strongly connected set of voxels may have a self- 
intersection along a crack as demonstrated in Fig. 1. The umbrella about the point P 
consists of six cracks and seven pixels: four of them are shown as shaded areas, the 
remaining three are not visible; they are shown by dotted lines. The pixels f „ F^, and 
F^ (the latter two are not indexed in Fig. 1) are all incident with the crack C^. 
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It is easily seen that imposing additional restrictions, such as e.g. locally strong or 
regular connectedness, onto the voxel sets under consideration does not essentially 
change the situation: boundaries of voxel sets encountered in applications do rarely 
possess the properties of manifolds. 




Fig. 1. Example of an umbrella with a self-intersection along a crack 

However, the boundaries still have a rigorous topological structure which may be used 
to organize the tracing of the boundary. We suggest to denote this structure as a two- 
dimensional quasi-manifold. 

Definition Ql: A one-dimensional quasi-manifold is a connected one-dimensional 
ACC in which every 1-cell is bounded by exactly two 0-cells and every 0-cell bounds an 
even number, at least two, of the 1 -cells. 

Definition Q2: A two-dimensional quasi-manifold is a connected two-dimensional 
ACC in which every 2-cell is bounded by 0- and 1 -cells composing a one-dimensional 
manifold, i.e. a cycle; every 1-cell is bounded by exactly two 0-cells and bounds an even 
number, at least two, of the 2-cells. The 1- and 2-cells bounded by a 0-cell compose one 
or more subcomplexes each of which is B-isomorphic (Definition 9) to a one- 
dimensional quasi-manifold. 

An example of a one-dimensional quasi-manifold is shown in Fig. 2. As an example of 
a two-dimensional quasi-manifold (2-quasi-manifold) consider the boundary (i. e. the 
surface) of the voxel set shown in Fig. 1. 

A notion important for handling n-quasi-manifolds is that of adjacency of «-cells as 
suggested in [RosKW91]. The adjacency defined there depends upon the voxel 
connectedness: the given set of voxels may be declared either as a 6- or 18-connected 
one. There is no such notion in the topology of cell complexes; the membership of 
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cells of lower dimensions in the set under consideration must be specified instead: e.g. if 
the crack common to two voxels having no common pixel belongs to the same set as the 
voxels then they are connected, otherwise they are not. 




Fig. 2. An example of a one-dimensional quasi-manifold: 
the point P bounds 8 1 -cells; each of the other points bounds two 1 -cells 

The adjacency of the w-cells of an w-quasi-manifold may be defined here by locally 
dissolving the quasi-manifold into manifolds. “Locally” means that only subsets of the 
quasi-manifold and of the desired manifolds in the neighborhood of the cell which 
bounds all the u-cells under consideration are considered. These subsets intersect neither 
each other nor the connected set of voxels. This topological foundation and 
generalization of the adjacency of u-cells in an w-quasi-manifold will be published in a 
separate paper. The adjacency used here is equivalent to that defined in [RosKW91] for 
6-coimected objects and an 18-connected background. 

In the simplest case of a Cartesian 3D space the difference between a 2-manifold and 
a 2-quasi-manifold consists in the following: 

1) A crack of a 2-quasi-manifold may bound four pixels, while a crack of a 2-manifold 
may bound only two. These four pixels compose two pairs of adjacent pixels. A path 
through the pixels of a 2-quasi-manifold, when traversing such a crack, must go from 
one of the adjacent pixels to the other. 

2) A point of a 2-quasi-manifold may belong to as many as four umbrellas (Definition 

12), while a point of a 2-manifold may belong to only one umbrella. If there are more 
than one umbrella incident with a point P, then they may be either separate (except 
the common point P) or two umbrellas may share a crack. In the latter case pairs of 
adjacent pixels must be specified to define the order of traversing the pixels in the 
neighborhood SON(T’, E) of a point P in the 2-quasi-manifold B. (The SON of a point 
is its smallest open neighborhood according to Definition 3a in the Appendix). 

Let us firstly consider sets of voxels which are strongly connected (Definitions 6 to 8) 
and do not touch the border of the 3D image. Another important requirement is that both 
the set and its complement are homogeneously tree-dimensional complexes (Definition 
11). We shall call such sets solid ones. This property is important when 
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regarding a set of voxels as an ACC, since the concept of an ACC allows that two voxels 
are in the set under consideration, however their common side (e.g. the common pixel or 
a common crack) is not. In such a case this side belongs to the boundary and makes the 
structure of the boundary rather complicated, not suitable for our purpose. 

Proposition 1: The boundary Z? of a simply and strongly connected solid subset V of 
the Cartesian 3D space (ACC) is a two-dimensional quasi-manifold. 

The proof may be found in Appendix 2. 



3 The Ideas of the Traversal Algorithm 

Our algorithm is based on the properties of a 2-quasi-manifold. Consider a surface which 
is the boundary B of a given set V of voxels. If V satisfies the conditions of Proposition I 
then B is a 2-quasi-manifold. Let us label the closure of a single pixel of B and consider it 
as the seed of the set L of labeled cells of B. The boundary of L relative to B is a 
1 -manifold, i.e. a closed sequence of alternating points and cracks. It may be traced by a 
sequence of steps from one point to the next one until the starting point is reached again. 
During this tracing the closures of pixels incident with the cracks of the boundary of L 
are joined with L and labeled as those belonging to L. In this way L becomes greater 
while remaining strongly connected. We shall show below, that if only pixels simple with 
respect to L are joined with L, then L remains simply connected and its boundary remains 
a 1-manifod. The boundary can be traced in the same manner as before. During the 
tracing of the current boundary of L the relative positions of joined pixels are stored as 
some kind of a three-dimensional chain code. This procedure must be repeated as long as 
there are unlabeled simple pixels incident with the cracks of the boundary of L. If the 
next pixel encountered during the tracing is a non simple one then it is not joined with L, 
i.e. it is not labeled. However its position is nevertheless included into the code to make 
the sequence of the encoded pixels continuous. At the beginning of the process the 
trajectory of the tracing is evolving like a spiral. The sequence of grasped pixels looks 
like the peeling of a potato. 

The tracing stops, when there are no more simple pixels incident with the boundary of 
L. At this stage all pixels of B are grasped in the code (some of them more then once). 
This code may be used as the code of the surface B. The code specifies the surface B and 
therefore also the set V uniquely: the set V may be exactly reconstructed from the code. 

Let us consider the theoretical background of these ideas. 

Definition S: When given a 2-quasi-manifold B and a subset L of B, a pixel F of the 
set B-L is called simple with respect to L if the intersection of its boundary Fr(F,B) with 
the boundary Fr(L,B) of L is connected. 
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We shall show next that if a simple pixel is united with L, then the number of 
components both of L and of B-L remains unchanged. Thus when only simple pixels are 
labeled, the spiral tracing runs continuously along the boundary of L until all simple 
pixels are labeled. After that the non simple pixels at the end of the sequence are labeled. 
This method (with some improvements) works well on all possible kinds of surfaces. 

Proposition 2: Let L be a subset of a 2-quasi-manifold B while the boundary Fr(L,Z?) 
of L relative to B is a 1 -manifold. Then the boundary Fr(LuCl(F),^) of the union 
LuCl(f) of L with the closure C1(F) of a pixel F of B-L having a common crack with L 
and being simple with respect to L is either still a 1 -manifold or it is empty. In the latter 
case L becomes identical with B. 

The proof may be found in Appendix 2. 

To describe the algorithm we need the following notions. 

The orientation of a movement along the boundary Vr{F,B) of a pixel F is called 
clockwise if the following three vectors compose a right system, similar to that of the unit 
coordinate vectors of the coordinate axes X, Y and Z: 

- vector 1 is pointing from B to a boundary cell; 

- vector 2 is that of the movement from that cell to the next one; 

- vector 3 is the inner normal to F, pointing to the inside of the object’s voxel. 

Such a movement is seen as a clockwise one around the center point of F when looking 
from outside of the object along the inner normal. 

Labeling a pixel F means setting a memory variable corresponding to this pixel to one. 
Simultaneously the variables corresponding to all cells of the boundary Fr(F,B) are set to 
one. At the start of the algorithm all variables are initialized by zero. In the computer 
realization each variable is a bit in the raster representation of the given 3D image 
containing one byte per voxel. 

As explained in Section 1, we use here the Khalimsky-coordinates of the cells. 
Therefore it is easy to calculate the movement from one cell to another which is incident 
with it: the movement is specified by the vector whose components are the differences of 
the corresponding coordinates. It is possible to interpret the coordinates of cells of 
dimensions greater then zero as those of their middle points, which makes the 
representation descriptive and comprehensible. 

To encode the movements from one cell to another during the tracing we use the 
notation represented in Fig. 3. The numbers from 0 to 5 denote the movements from one 
cell to another incident with it since these movements are always parallel to one of the 
coordinate axes. 

Hypothesis: Let In be the set of all pixels of B—L, which are incident with Fr(L,B). If 
In contains no simple pixels then it is identical with B-L. If it is empty, then the genus of 
B is zero. 

The hypothesis was confirmed by numerous computer experiments with surfaces of 
genus from 0 to 9. 

We denote the code of the set In which contains no simple pixels as the rest sequence. 
It contains all codes obtained during a full tracing of the boundary Fr(L,B) after the last 
simple pixel was found. 
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The Algorithm: 

1. Take any pixel of B as the starting pixel label its closure and save its coordinates as 
the starting coordinates of the code. This is the seed of L. Denote any one crack of the 
boundary of Fr(F„,B ) as and find the pixel F of B which is incident with and 
adjacent with F„. Set equal to F,, and the logical variable REST to FALSE. REST 
indicates that the tracing of the rest sequence is running. 

2. (Start of the main loop:) Find the crack as the first unlabeled crack of Fr(F,B) 
encountered during the scanning of Fr(F,B) clockwise while starting with the end 
point of which is in Fr(L,B). If there is no such crack and F is labeled stop the 
Algorithm: the encoding ofB is finished. 

3. If F is simple label its closure. 

4. Put the direction of the movement from F^^„ to and that of the movement from 

to F into the next byte of the code. If the pixel F is non simple set the corresponding 
bit in the code (to recognize codes of non simple pixels in the ultimate sequence). 




Fig. 3. Encoding of the directions of the coordinate axes and cracks 



5. If REST is TRUE check, whether F is equal to and is equal to C ^. (These 
variables may be defined in item 6 of the previous loop ). If it is the case stop the 
Algorithm and analyze the rest sequence to specify the genus of B as explained below. 
Delete multiple occurrences of pixels from the rest sequence. 

6. If F is simple set REST equal to FALSE. If F is non simple set equal to F, 

equal to and REST equal to TRUE. 

7. Set F equal to F. Find the pixel F^^^ of B incident with and adjacent to F. Set F 
equal to F^^^ and equal to Go to item 2. 

End of the Algorithm. 

As it may be seen from the Algorithm, after the last simple pixel has been found, the 
tracing of Fr(F,B) is continued and the non simple pixel are recorded (in the rest 
sequence) until the path along the boundary Fr(L,B) becomes closed. The genus of the 
surface is computed by means of the Euler number of the rest sequence, which in turn 
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may be deduced from the number of non simple pixels which were visited three or four 
times. 

Proposition 3: The genus G of B may be deduced from the properties of the rest 
sequence. It is equal to: 

G=(1+A^3/2+A^,)/2; (1) 

where M, is the number of pixels which occur in the rest sequence three times and is 
the number of pixels which occur in the rest sequence four times. The proof may be 
found in Appendix 2. 

The run time of the Algorithm is obviously proportional to the length of the code, 
since the time necessary to produce a code element is constant. The length of the code 
may be essentially greater then the number of pixels in B since the non simple pixels may 
arrive in the code many times. The worst case take place for sets of voxels looking like a 
dumbbell: two cylinders of a constant diameter connected at their flat ends by a long thin 
rod. After the pixels in the surface of the rod are labeled except of a sequence one pixel 
wide, the tracking may run along the rod back and forth as many times as the number of 
pixels in the height of the cylinders. Thus the worst case complexity is 0(BP), N being the 
number of pixels in B. 

There is a possibility, to make the complexity linear in N. This may be reached if the 
Algorithm changes the tracing mode from the clockwise to the counterclockwise one 
each time when a non simple pixel occurs. This version of the Algorithm is described 
below. 



4 The Reversible Tracing 

The only drawback of the method of Section 3 is that the part of the boundary of L which 
is incident with non simple pixels must be sometimes traced again and again in order to 
reach all simple pixels. This is due to the fact that in spite of the connectedness of B-L 
the subset of simple pixels in B-L may be disconnected. Thus the sequence may contain 
many non labeled pixels and the sequence becomes unnecessary long. 

This difficulty may be overcome by means of reversing the direction of the tracing 
each time when a non simple pixel is encountered. This means that the tracing is being 
continued in such a direction that the labeled pixels lie no more to the left hand side of 
the direction (left tracing) but rather to the right hand side (right tracing). 

Fig. 4a shows an example of the left tracing: the pixels labeled before are shown as a 
shaded region. They lie to the left hand side of the direction of the tracing. The pixel „N“ 
is a non simple one: its boundary crosses the boundaries of the pixels which were labeled 
before at two disconnected locations, namely at the pixels 3 and 14. Fig. 4b shows how 
the tracing has been continued as the right tracing starting with pixel 14 and running 
through the pixels 15 to 22. 
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When realizing the reversible tracing, it is more expedient to trace not the usual 
boundary of the labeled region but rather the so-called „open boundary“ (Definition 4a, 
[Kov92]) of this region. Sequences of traced pixels as shown in Fig. 4a and Fig. 4b are 
examples of subsets of the open boundary of the labeled region. The reason of changing 
to open boundaries is that the sequence of cracks with incident pixels does not contain the 
pixels incident with the „comers“, such as e.g. the pixel 3 in Fig. 4a. This was admissible 
for the algorithm of Section 3 : a missing pixel was registered during the next turn of the 
spiral. However it is not admissible for the reversible tracing, as we shall see in what 
follows. 
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Fig. 4. Examples of reversible tracing: a) left tracing trough the pixels 1 to 14; 
b) continued as right tracing through pixels 15 to 22; c) simultaneous left and right tracing 



In the most cases, when a non simple pixel is met in the open boundary, the tracing 
may be continued in the reversed direction, as e.g. in Fig. 4a, continued in Fig. 4b. 
However there are cases, when directly after such a reversing no simple pixels may be 
found (Fig. 4c). Then the tracing must be continued simultaneously in both directions. 
Two sequences of pixels are then recorded: one for the left tracing and one for the right 
one. As soon as in one of these sequences a simple pixel is encountered, the other 
sequence is discarded. In this way the number of non simple pixels in the resulting 
sequence is minimized. 

In the example of Fig. 4c the left tracing goes on until the non simple pixel 5 is 
reached. Since that location two sequences are recorded: 5, 6, 7, 8, etc. for the left tracing 
and 5, 9, 10, 1 1 for the right tracing. At the location 12 a simple pixel is found. Thus the 
codes for the pixels from 6, 7, 8 etc. are discarded. The sequence 5, 9, 10, 11 is 
transferred into the ultimate sequence and the tracing is continued from pixel 12. 

The question may arise: when will be the pixels 6, 7, 8 etc. picked up again and 
included into the code? According to Proposition 2, the boundary of L remains always a 
1 -manifold, i.e. a closed sequence of points and cracks without branching. Therefore one 
may be sure that at some time later on, the boundaries of these pixels will be reached by 
the tracing procedure. 
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To make this assertion illustrative we can continue the image of Fig. 4c in two 
different ways. If B is of genus 0, then the situation may look as shown in Fig. 5: pixels 
from 12 to 18 will be labeled since they are simple. After the pixel 18 has been labeled, 
the pixel X becomes simple an so do the pixels 11, 10, etc. until Y. 
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Fig. 5. A possible extension of the fragment of Fig. 4c; explanation in text 

The ultimate sequence may contain (besides the rest sequence) some non simple pixels 
which are necessary to encode the path to further simple pixels. However the non simple 
pixels remain unlabeled. If the surface B is of genus 0 then the set of not labeled pixels 
incident with the current boundary of L becomes smaller and smaller until it is empty. 
Then the encoding of such a surface is finished. 

The other possible way to extend the image of Fig. 4c is to considered it as a fragment 
of a surface of genus greater then 0. In this case the boundaries of the pixels 6, 7, 8 etc. 
will belong to the rest sequence as explained above and illustrated by Fig. 6. The pixels 
will be included into the code as non simple ones. 





Fig. 6. An example of the pixels of the rest sequence: 
a) the simple pixels of a torus are shaded, the non simple ones are left white; 
b) the set of non simple pixels of the same torus without the voxels 
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In the case of a surface of a genus greater than 0 (e.g. a toms) a closed sequence of 
non simple pixels arises at the end of the tracing procedure: the left and the right tracings 
are continued without finding a simple pixel until they meet each other while having 
opposite directions. The algorithm must be organized in such a way that the sequences 
contain all non simple pixels: if some pixels at the starting point of the simultaneous 
tracing are missing, then they are lost for ever. This is the reason of using the open 
boundary rather then the closed one: the open boundary is a continuous sequence of 
adjacent pixels. 

As in the method of Section 3, the non simple pixels contained in the rest sequence are 
labeled with „bmte force“. The record of a pixel in the rest sequence is changed as for a 
simple pixel when the pixel arrives in the rest sequence for the first time. After that some 
records of non simple pixels still remain at the end of the sequence. These are the 
multiple records of pixels which were visited more than once. These records are used for 
the calculation of the genus and then deleted. 



5 Computer Experiments 

The method of Section 4 was successfully tested on many different 3D images. The first 
experiments were made with simple artificial volumes as a hemisphere or a hemisphere 
with a narrow vertically stretched parallelepiped attached at the upper side of the 
hemisphere like a chimney. The surface of the latter object has a folder in the umbrella 
where a vertical crack of the hemisphere touches a vertical crack of the chimney. More 
recent experiments were conducted with tori of different size and with objects having up 
to 9 tunnels, whose surfaces are of genus from 2 to 9. In all cases the surface was 
scanned and encoded completely. The 3D objects were exactly reconstructed from the 
code. 



5.1 Efficiency of encoding 

The pixels of the sequence are encoded by 7 bits each as explained below. Thus the code 
contains one byte per pixel. The encoding may be made more economical by the factor of 
2 by means of the difference crack-code as suggested in [Kong92]. We have not realized 
this improvement in order to make the encoding and decoding as simple as possible. 

The efficiency of encoding, measured as the ratio of the number of the pixels in the 
surface to the number of elements in the ultimate sequence (plus one), is essentially 
better as that of the method of Section 3. It depends on the genus of the surface, as shown 
in Table 1 for the “worst case”: small objects whose walls are only one voxel thick. 

For bigger objects the efficiency is much better, as shown by the upper curve in Fig. 7. 
Thus, for example, the code of an parallelepiped of 38x38x19 voxels with 9 tunnels 
parallel to the Z-axis is 8952 bytes long. The surface contains 7704 pixels. The efficiency 
is equal to 0.861. 
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Table 1. Efficiency of the Code as a Function of the Number of Tunnels 



Number of tunnels 


0 


1 


2 


3 


4 


5 


6 


Method of Sect. 3 


1.0 


0.485 


0.336 


0.231 


0.179 


0.147 


0.125 


Method of Sect. 4 


1.0 


0.842 


0.685 


0.624 


0.619 


0.608 


0.557 




0 5 Number of tunnels 

Fig. 7. Efficiency of encoding small objects with the method of Section 4 (middle curve) 
as compared to that of Section 3 (lowest curve); 
the upper curve represents objects of 38x38x19 voxels 

These results were compared with that of encoding a surface as an Euler circuit as 
suggested in [RosKW91]. In all our experiments the efficiency of the Euler circuit was 
about 0.67. It should be mentioned here, that the method of Euler circuit is only 
applicable to a whole closed surface, while our method may also be used to encode any 
connected part of the surface, which possibility is important e.g. for dissolving the 
surface into digital plane patches. 



5.2 Examples of Codes. 

The codes of the pixels are represented as octal numbers of three digits. The first digit 
shows, whether the pixel is simple (digit=0) or not (digit=l). The second digit is the 
direction of the movement from the a pixel to the following crack; the third digit is the 
direction of the movement from the crack to the next pixel. The directions are encoded as 
shown in Fig. 3. 
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1. A bar consisting of three voxels along the Y axis. 14 pixels, 13 code elements. 
Coordinates of the first pixel =(3, 3, 4); 

The code: 

035, 040, 001, 01 1, 023, 035, Oil, 020, 005, 013, 054, 044, 044; 



2. A torus of 8 voxels; the axis of the tunnel is parallel to the Z axis. 32 pixels, 37 code 
elements. 

Coordinates of the first pixel =(3, 3,4); 

The code: 

035,040,000,021,015, 031,023,035,011,020, 000,045,051,000,144, 
002,011,113,124,144, 144,005,053,011,032, 020,011,015,033,033, 

054, 044, 044, 000, 100, 042, 021; 

3. A B-shaped body with two tunnels parallel to the Z axis; 50 pixels, 72 code elements: 

Coordinates of the first pixel =(3, 3, 4); 

The code: 



035, 040, 000, 021, 015, 
002, 010, 051, 012, 000, 
011,023,033,033,044, 
033, 033, 054, 044, 044, 
111 , 111 , 112 , 100 , 100 , 



031,023,035,011,020, 
054, 044, 032, 020, 044, 
044, 000, 015, 054, 033, 
000, 100, 042, 121, 111, 
124, 100, 100, 105, 144, 



000, 045, 051,000, 144, 
133,045, 000, 001,011, 
011,032, 120, 111,015, 
135, 150, 144, 133, 133, 
144, 053, 



The length of the code is always of the order of the number of the pixels in the 
surface: in the case of genus zero the number of bytes in the code is equal or a little 
greater then the number of pixels depending on the complexity of the shape of the 
surface. For surfaces with genus greater then zero the number of bytes is slightly greater 
than the number of pixels, depending on the genus and on the size of the surface 
(compare Fig. 7). 



Conclusion 

A new method of representing a surface in the 3D space as a single digitally continuous 
sequence of pixels is suggested. According to the method the surface is encoded as a 
single sequence of mutually adjacent pixels. Each pixel is encoded by one byte. The code 
of the surface of a three-dimensional object takes much less memory space then the raster 
representation of the object. The object may be exactly reconstructed from the code. 
Surfaces of a genus greater that zero (e.g. that of a torus) may also be encoded by a single 
continuous sequence. The algorithm recognizes the genus of the surface. The method is 
well suited for dissolving a given surface into patches of digital planes, which is its 
advantage over known methods. 
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Appendix 1: Topology of Cell Complexes 

In this section we shall remind the reader some topological notions necessary to read the 
presentation. In this way we hope to make the paper self-contained. 

Definition 1: An abstract cell complex C=(E, B, dim) (ACC) is a set E of abstract 
elements provided with an antisymmetric, irreflexive, and transitive binary relation 
B E X E called the bounding relation, and with a dimension function dim: E ^ I from 
E into the set / of non-negative integers such that dim(e’)< dim{e") for all pairs {e\e")&B. 

Elements of E are called abstract cells. It is important to stress that abstract cells 
should not be regarded as point sets in a Euclidean space. That is why ACC’s and their 
cells are called abstract. If a cell e' bounds another cell e " then e' is called a side of e 
The sides of an abstract cell e" are not parts of e"\ the intersection of two distinct abstract 
cells, differently from that of Euclidean cells, is always empty. 
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The maximum dimension of the cells of an ACC is called its dimension. We shall 
consider ACC’s of dimensions 2 and 3. Their cells with dimension 0 (0-cells) are called 
points, cells of dimension 1 (1 -cells) are called cracks (edges), cells of dimension 2 (2- 
cells) are called pixels and that of dimension 3 are the voxels. 

Definition 2: A subcomplex S = (E’, B\ dim’) of a given ACC C = (E, B, dim) is an 
ACC whose set E’ is a subset of E and the relation Z? ’ is an intersection of B with E’x E’. 
The dimension dim’ is equal to dim for all cells of E’. 

Definition 3: A subset OS of cells of a subcomplex S of an ACC C is called open in S 
if it contains all cells of S bounded by some cells of OS. 

We need this Definition to consider subsets which are open in the surface but not in 
the three-dimensional space. Note that in the topology of cell complexes there exists the 
notion of the smallest open neighborhood of a cell. This is the smallest open subset 
containing the cell. 

Definition 3a: The smallest subset of a set S which contains a given cell c of 5 and is 
open in S is called the smallest open neighborhood of c relative to S and is denoted by 
SON(c, S). 

Definition 4: The boundary Fr(5', C) of a subcomplex S of an ACC C relative to C is 
the subcomplex of C containing all cells c of C whose SON(c, C) contains both cells of S 
as well as cells of the complement C-S. 

Definition 4a: The open boundary Ob(5, C) of a subcomplex S of an ACC C relative 
to C is the subcomplex of C containing all cells c of C whose closure c u Fr(c, C) 
contains both cells of S as well as cells of the complement C-S. 

Definition 5: Two cells e’ and e" of an ACC C are called incident with each other in 
C iff either e’=e", or e’ bounds e", or e" bounds e’. 

Definition 6: Two cells e’ and e" of an ACC C are called connected to each other in 
C iff either e’is incident with e", or there exists in C a cell c which is connected to both e’ 
and e ". According to this recursive definition the relation of connectedness is a transitive 
closure of the incidence relation. 

It may be easily shown that the connectedness relation according to Definition 6 is an 
equivalence relation (reflexive, symmetric and transitive). Thus it defines a partition of 
an ACC C into equivalence classes called the components of C. 

Definition 7: An ACC C consisting of a single component is called connected. 
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Definition 8: A path in an ACC C of the form 

AqAi 

where x" is an n-dimensional and x" ^ is an (n-l)-dimensional cell of C, is called 
an n-dimensional path in C. An n-dimensional ACC C is called strongly connected if any 
two n-dimensional cells of C may be connected by an n- dimensional path in C. 

Definition 9: Two ACC’s are called B-isomorphic to each other if there exists a one- 
to-one correspondence between their cells which retains the bounding relation. 

Definition 10: An n-dimensional finite manifold is an n-dimensional ACC 
satisfying the following conditions: 

a) a 0-dimensional manifold M consists of two cells with no bounding relation 

0 

between them; 

b) an n-dimensional manifold with n > 0 is connected; 

c) for any cell c of the subcomplex of all cells different from c and incident with c 
is B-isomorphic to an (n-l)-dimensional manifold. 

Definition 11: An n-dimensional ACC C is called homogeneously n-dimensional if 
every A:-dimensional cell of C with k<n is incident with an n-cell of C. 

Definition 12: A subcomplex of a two-dimensional ACC C, consisting of all cells of 
C incident with a 0-cell B of C is called an umbrella of P. 



Appendix 2: Proofs of Propositions 

Proof of Proposition 1: To prove the proposition we must demonstrate that: 

1) The boundary of R is connected. This follows from the simply-connectedness of R. 

2) The boundary cells incident with a boundary pixel compose a one-dimensional 
manifold (1-manifold); 

3) There are two or four boundary pixels incident with each boundary crack. The pairs of 

adjacent pixels may be specified depending on the membership [Kov93] of the crack 
in the set V as explained below. 

4) The boundary cells incident with a boundary point compose one or more 
subcomplexes (umbrellas) each of which is B-isomorphic to a 1 -quasi-manifold. 

Consider first a boundary pixel F. In a Cartesian 3D space it is bounded by four cracks 
and their four end points. These eight cells compose a 1 -manifold. It is necessary to 
demonstrate, that all of them belong to the boundary of V. According to Definition 3, 
their SON’s contain the SON of the pixel B as a subset. Since V is solid, it is impossible 
that the voxels bounded by F belong both to V or both to its complement. 
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Thus the SON of F (consisting of F and two voxels) contains a voxel of V and a voxel of 
the complement of V. So do the SON’s of all the eight cells mentioned above. Thus they 
all belong to the boundary B and compose a 1 -manifold. 

A boundary crack C is incident with four voxels. They have four pixels incident with 
C. If only one of the four voxels is in V or the voxels being in V compose a strongly 
connected path, then exactly two of the four pixels are boundary pixels. They must be 
considered as adjacent. In the case when two diagonally adjacent voxels belong to V and 
the two other to the complement of V, all four pixels are boundary pixels. If the crack is 
considered as belonging to V, then two pixels bounding one and the same voxel of the 
complement must be considered as adjacent. Otherwise adjacent are pixels bounding one 
and the same voxel of V. 

The most complicated situation we have with a boundary point P. In the Cartesian 3D 
space it is incident with 6 cracks and 12 pixels. Some of them belong to the boundary of 
V. Their number depends upon the location of the voxels of V incident with P. Let us take 
a boundary crack C, and a boundary pixel F, both incident with P. There exists exactly 
one crack different from Cj which is incident with F^ and P. Thus each pixel in the 
neighborhood SON(T’, B) in the boundary B is incident with exactly two distinct cracks. 
According to the analysis of the preceding paragraph a boundary crack is incident with 
two or four boundary pixels. The cracks incident with P bound the pixels incident with P. 
This is the same situation as in a 1 -quasi-manifold, where any crack is incident with two 
points and any point is incident with an even number of cracks. Thus the subcomplex 
consisting of cracks and pixels in the SON(T’, B) is 5-isomorphic to a 1 -quasi-manifold. 

Lemma 1: If a I-manifold is a subset of another I-manifold then the manifolds are 
identical. 

Proof: Suppose the contrary. Then there must be in a pair of adjacent cracks the 
first one C, being in and the second Q not in M^. The common end point P of C, and 
Cj is both in and in M^. Let be the second of two cracks of incident with P. 
According to the supposition, is not identical with Q. Then C, is not in M^, since 
according to the definition of a 1-masnifold the are exactly two cracks incident with P in 
Mj. A contradiction. 



Proof of Proposition 2: Let L be a subset of a 2-quasi-manifold B while the boundary 
Fr(L,5) of L relative to 5 is a 1 -manifold. Let us consider the union of L with the closure 
C1(F) of a pixel F of B-L having a common crack with L and being simple with respect 
to L. The simplicity of F means that the intersection I of the boundary Fr(F,5) with L is 
connected. The intersection I contains at least one crack, namely the common one. The 
common cracks contained in I are no more boundary cracks of the union LuCl(F)- Only 
the cracks of the boundary Fr(F,5) which do not belong to the intersection belong to the 
boundary of the union. If there are no such cracks then the boundary Vr{F,B) lies 
completely in Fr(L,B). Both boundaries are 1 -manifolds. According to Lemma 1 they are 
identical. Thus the boundary Fr(LuCl(F),fi) becomes empty. 

If there are cracks in Fr(f ,5) not belonging to I then they replace the cracks of I in the 
new boundary of L. Since F is simple, this new part of the boundary of L is also 
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connected and has no common cells with the old boundary of L besides two points of I. 
Thus the new boundary is a 1 -manifold. 

Proof of Proposition 3: To specify the genus G of B we first calculate its Euler number. 
Z? is a union of L and B-L. The Euler number of L is equal to 1 since L was developed 
from a single pixel by means of an operation which does not change the Euler number. 
According to Hypothesis of Section 3, the set B-L consists of pixels incident with 
Fr(L,5) and cracks lying between these pixels. It is a stripe one pixel wide. It may be 
considered as a union of loops, like those shown in Fig. 6. The Euler number of a loop is 
zero since it consists of an equal number of pixels and cracks. If two loops intersect 
(compare Fig. 6) then they have P pixels and P-1 cracks in common (the cracks of the 
boundary Fr(L,P) do not belong to the loops). The count P-(P-1)=1 of these cells must 
be subtracted form the sum of the Euler numbers of the loop being zero. Thus each 
location containing an intersection of two loops reduces the Euler number by 1 . On the 
other hand, at each intersection there is either one pixel with 4 adjacent pixels in the 
loops or two pixels with 3 adjacent pixels. Thus the number or half the number N^, i.e. 
NJ2, must be subtracted from the Euler number of the loops. Therefore the Euler number 
£■ of P is equal to E=\-NJ2-N^ and the genus G of P is equal to G={2— 
E)I2^{\+NJ2+N^I2. 
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Abstract. As its analogue in the continuous framework, the digital fun- 
damental group represents a major information on the topology of dis- 
crete objects. However, the fundamental group is an abstract informa- 
tion and cannot directly be encoded in a computer using its definition. 
A classical mathematical way to encode a discrete group is to find a pre- 
sentation of this group. In this paper, we define a presentation for the 
fundamental group of any subset of a digital surface. This presentation 
can be computed by an efficient algorithm. 



Introduction 

As its analogue in the continuous framework, the digital fundamental group, 
originally introduced by T. Y. Kong in [4] in the 3D case (see also [5]), represents 
a major information on the topology of discrete objects. It is in particular related 
to the notion of a simple point in 3D ([!]), and an even closer relationship between 
the fundamental group and topology preservation has been established within 
digital surfaces ([9]). 

For these reasons, we would obtain a very powerfull tool for pattern recog- 
nition if we could make the information of the fundamental group accessible to 
computers in the 3D case. An even more difficult problem is to find an algorithm 
to determine whether two given 3D discrete objects have isomorphic fundamen- 
tal groups. Such an algorithm would represent a significant step to determine 
whether two objects could be the same up to some “continuous deformation” . 

The fundamental group contains strictly more information than the first ho- 
mology group. However, whereas the first homology group is always accessible 
to computers, it is somehow difficult to transform the abstract notion of the 
fundamental group into some data which can be handled by computers. A clas- 
sical way to encode certain types of discrete groups is to find presentations of 
these groups. A presentation of a group is a model of the group up to isomor- 
phism, which is characterized by an integer and some words on an alphabet 
called relators. Such a data can easily be encoded in a computer. 

Since the problem of computing a presentation of the fundamental group 
seems difficult in the general case, we are lead to study it, at first, in simpler 
frameworks than the complete 3D case. Though the practical motivation comes 
from the 3D case, the study of some easier cases gives some ideas on how to 
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proceed in the general case. In [7] (see also [8]), the author studies the planar 
2D case. In this case, the fundamental group is shown to be isomorphic to a 
free group (i.e. there are no relators in this case). This is an ideal case since 
computations in the free groups are easy to perform, and the type of the group 
up to isomorphism is easily determined. In this paper, we study the case of 
digital surfaces. 

Digital surfaces of three dimensional objects have proved to be a fruitful 
model for visualization and analysis of the objects they represent ([2]), especially 
in the biomedical field. Efficient algorithms for extracting surfaces from volumes, 
and computing shape characteristics exist ([6]). Sometimes, the surface itself 
needs to be segmented since some particular points are defined on it. Then we 
obtain a subset X of the set of the surfels of the surface. In [9], we introduce 
two complementary notions of adjaceney between surfels on a digital surface, 
which can be used to study the topology of such subsets X of a digital surface, 
and we study the problem of topology preservation. In particular, a relationship 
is established between the fundamental group, and the relation between sets 
generated by sequential deletions of simple surfels. Here we consider the same 
model for the fundamental group. 

We apologize for the high number of definitions necessary to the statement of 
the results of this paper. After having set these definitions, since the statement 
of the results is very technical, we first give an intuitive and non formal idea of 
how to proceed in the continuous case. Finally, we construct a presentation of 
the fundamental group of any connected subset of a digital surface. Our proof 
is constructive and leads to an efficient algorithm to compute this presentation. 

1 Basic Definitions and Notations 

1.1 Groups, Normal Subgroups 

We consider a group (T, *) with Ir as unit element. Let h G F. A conjugate of 
h in r is an element of T of the form: g * h* g~^ with g G F. Now let FI he a 
subgroup of F, we say that H is normal in F if for any h G FI, any conjugate of 
h in F belongs to FI . 

Given FI a normal subgroup of F, we consider =h the relation on F defined 
by [g =H g'] [g' * g~^ G H], Since FI is a normal subgroup of F, this is an 

equivalence relation. We denote hy F/H the set of equivalence classes of elements 
of F under the relation =h, and by pn '■ F — > F/H the projection which to 
an element of F associates its equivalence class under =h- Now, if g =h g' and 
gi =H g'l, then since H is normal we have g* gi =h g' * 9i - Hence the product * 
defines an operation, which we also denote by *, on F/H. To pnid) and Pnid'), 
this operation associates pnig * g')- Obviously, the element p_y(lr) of F /H is a 
unit element for the operation * on T / H . Furthermore, given g G F, the element 
Ph{9~^) of F/H is an inverse for pnig)- Therefore, (F/H,*) is a group which 
we call quotient group of F by the normal subgroup H . Intuitively, all elements 
of H are collapsed with the unit element to obtain the quotient group. 
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Now let P be any subset of P. We consider the subset H of P composed of 
all products of conjugates of elements of P and inverses of elements of P. Then 
H is a, normal subgroup of P which we call normal subgroup of P generated by 
P. The normal subgroup generated by P is the smallest normal subgroup of P 
which contains P. 



1.2 Free Group, Generators and Relations 

Now, before to introduce groups given by generators and relations, we must 
introduce the notion of the (non abelian) free group with m generators. Let 
{oi, . . . , Om} U . . . , a“^} be an alphabet with 2m distinct letters, and let 

Wm be the set of the all words over this alphabet (i.e. finite sequences of letters 
of the alphabet). We say that two words w € Wm and w' G Wm are the same up 
to an elementary simplification if, either w can be obtained from w' by inserting 
in w' a sequence of the form aia~^ or a sequence of the form a~^ai with i G 
{1, . . . ,m}, or w' can be obtained from w by inserting in w a sequence of the 
form or a sequence of the form af^Oi with i S {1, . . . , m}. Now, two words 

w G Wm and w' G Wm are said to be free equivalent if there is a finite sequence 
w = wi, . ■ ■ ,Wk = w' of words of Wm such that for i = 2, . . . ,k the word 
Wi-i and Wi are the same up to an elementary simplification. This defines an 
equivalence relation on Wm, and we denote by the set of equivalence classes 
of this equivalence relation. If w G Wm, we denote by w the class of w under the 
free equivalence relation. The concatenation of words defines an operation on 
Tn which provides with a group structure. The group thus defined is called 
the free group with m generators. 

We denote by Im the unit element of Pm, which is equal to w where w is 
the empty word. The only result which we shall admit on the free group is the 
classical result that if a word w G Ln is such that w = Im and w is not the 
empty word, then there exists in w two successive letters aia~^ or a~^Qi with 
i G {!,... , m}. This remark leads to an immediate algorithm to decide whether 
a word w G Ln is such that w = Im- If w = a")) ■ • • is a word of Wm, we 

denote by w~^ the word ■ ■ ■ a))^^ . We have: 

Now we introduce the groups given by generators and relations. We consider 
TO G N* and a finite subset TZ of Wm- In this context, we call elements of TZ 
relators. We want to define a group from the free group with to generators, in 
which the words of TZ represent the unit element. This group, called the group 
with TO generators and the relations of TZ, is the quotient of the free group Pm 
by the normal subgroup generated by the set of the equivalence classes in Pm of 
elements of TZ. Now, given 77 a group, finding a presentation of 77 is finding an 
(explicit) isomorphism from II to a group given by generators and relations. 



1.3 Gonnectedness in Digital Spaces 

Let 77 be a fixed set and let X C 77. We denote by card (77) the number of 
elements of X and we denote X = 77 \ 77. In the following, we shall define 
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an adjacency relation a on X to be an antireflexive symmetric binary relation 
on X. An a-path with a length p is a sequence (ccq, . . . ,Xp) in which Xi-i is 
a-adjacent or equal to Xi for i = 1, ... ,p. Such an a-path is called closed if 
and only if xq = Xn and is called simple if the points Xi for f S {0 . . .p} are 
pairwise distinct. Given an a— path c = (xq,... ,Xp), we denote by c~^ the 
reversed a— path (xp, . . . , xq). Two elements x and y are said to be a-connected 
in X if there exists an a-path {xq, . . . , Xp) in X with xq = x and Xp = y. The 
a— connectedness relation is an equivalence relation and we call a— connected 
components its equivalence classes. A set is a— adjacent to an element x if at least 
one element of the set is a— adjacent to x. We also define the a— neighborhood 
Na{x) of X by Na{x) = {y G S / y is a — adjacent to x}. 

1.4 Structure of a Digital Jordan Surface 

We describe here the model of surface we consider, and some local structures 
which can be defined on such a surface. Afterwards, we do not consider anymore 
the volume from which the surface is build, and we give some intrinsic definitions 
and results on subsets of a surface. 

First we recall some definitions, which can be found for example in [3] or 
[10], restricted to the three dimensional case. In the following, voxels may be 
seen as units cubes rather than points of Z^. We consider two kinds of adjacency 
between voxels. Two voxels are said to be 18— adjacent if they share a face or 
an edge. They are said to be Q— adjacent if they share a face. A surf el is a pair 
(c, d) of 6— adjacent voxels. It can be seen as a unit square shared by c and d. A 
surface is a set of surfels. 

Let O C be a 6— connected or 18— connected set. We consider S the set 
of all surfels of the form (x, p) with x G O and y G O: we call E the surface 
of O. it is possible to define an adjacency relation between surfels of E. Such 
a surfel has exactly 4 neighbors, one per edge under this relation. We call this 
adjacency relation the e— adjacency relation on the surface of O. Let s and s' 
be two e— adjacent surfels of E. The surfels s and s' share an edge. The pair 
{s, s'} is called an edgel. The definition of the e— adjacency relation depends on 
whether we consider O as 18— connected or 6— connected. 

The kind of surface thus defined satisfies the Jordan property ([3]): a surface 
separates the space into two parts, one of which is 6— connected, and the other 
one which is 18— connected. This kind of surface is widely used in image analysis 
and manipulation. 

We define a loop in A as an e— connected component of the set of the surfels 
of E which share a vertex w. For example, in Figure l.b, we see an object with 
three voxels. The vertex w marked with a filled circle defines two loops, one 
which can be seen on the figure and is composed of 3 surfels, and the other 
loop which is hidden and is composed of 6 surfels. Considering loops is a way to 
duplicate formally such vertices. We can define a unique cycle in any loop 1: from 
a pair (si,S2) of e— adjacent surfels in I choose S3 the unique surfel of I which 
is e— adjacent to S2 and which is distinct from si. By repeating this process we 
obtain a unique simple closed e— path of surfels which we call a parametrization 




140 Remy Malgouyres 




Fig. 1. Counter example and example 



of the loop 1. The length of a loop ranges between 3 and 7. All the surfels of a loop 
share a common vertex. For this reason, we say that two surfels are v— adjacent 
(vertex adjacent) if they belong to a common loop. 

Note that in the case of a planar surface orthogonal to one of the coordinate 
lines (i.e. in the case when the object O is a half space), w— adjacency coincides 
with the classical 2D digital image notion of 8— adjacency and e— adjacency to 
the classical 4— adjacency. 

Definition 1 (d— cell). We associate a dimension to surfels, edgels, and loops, 
which is equal respectively to 2, 1, and 0. We can identify a surfel s with {s}. 
We call a surfel a 2-cell, an edgel a 1-cell, and a loop a 0-cell. 

This dimension is compatible with the continuous analog of the digital sur- 
face. If a surfel is a member of a loop or of an edgel, we also say that it is incident 
to this loop or this edgel. Moreover, whenever an edgel is a subset of a loop, we 
also say that it is incident to this loop. We see that each surfel is incident to 4 
loops and 4 edgels, and each edgel is incident to 2 loops. 

In the sequel, we take the assumption that each loop of the surface of our 
object O is a topological disk. More precisely, we assume that any two v— adjacent 
surfels which are not e— adjacent cannot both belong simultaneously to two 
given distinct loops. For instance, in Figure 1, the object (a) does not satisfy 
this hypothesis: The two loops corresponding to the vertices marked with filled 
circles contain two non e— adjacent surfels in common. In opposite, the object 
(b) satisfies our hypothesis. We can express this assumption on the object O 
the surface of which we consider, saying that we assume that if O is considered 
as 18— connected and x and y are two 18— adjacent voxels of O which are not 
6— adjacent, one of the two following properties is satisfied: 

1. The voxels x and y have an 18— neighbor (or 6— neighbor) in O in common; 

2. The voxels x and y have two 26— neighbors in O in common which are 
themselves 26— adjacent. 

We must have the same assumptions on O if O is considered as 6— connected. 

In the sequel of this paper, we consider S a fixed e— connected component of 
a digital surface, and n G {e, u}. We also denote by n the element of {e, u} such 
that {n,n} = {e,v}. 
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2 Fundamental Groups, Topology Preservation 

2.1 Simple Surfels, Homotopy 

Let a; G If . As we have already said, we assume that any loop in 17 is a topological 
disk. However, the u— neighborhood of the surfel x is not always a topological 
disk (see the u— neighborhood of the grey surfel of the surface of Figure l.b for 
instance). If this is the case, we have to define a topology on Ny{x) U {a;} under 
which it is a topological disk. Let us consider two surfels y and y' in 7V„(a;) U{a:}. 
We say that y and y' are Cx — adjacent [respectively Vx—adjacent] if they are 
e— adjacent [respectively u— adjacent] and are contained in a common loop which 
contains x. We denote by Ge{x,X) [respectively Gy{x,X)] the graph whose 
vertices are the surfels of Ny{x) n A and whose edges are pairs of Ca,— adjacent 
[respectively Wa,— adjacent] surfels of Ny{x) n X. We denote by Cj^{Gn{x, A)) the 
set of all connected components of G„(x,A) which are n— adjacent to x. Note 
that C“(G„(a;, A)) is a set of subsets of the set of all surfels of A and not a set 
of surfels. 

Definition 2. We call x an n— isolated surfel if Nn{x)r\X = 0 and an n— interior 
surfel if Nn{x) n A = 0. 

Definition 3 (Simple surfel). A surfel x is called n— simple in X if and only 
if the number cdxd{Cf{Gn{x,X))) of connected components o/G„(x,A) which 
are n— adjacent to x is equal to 1, and if x is not interior to X. 

Definition 4 (homotopy). Let hcY C A C A. The set Y is said to be (lower) 
n— homotopic to X if and only ifY can be obtained from X by sequential deletion 
of n— simple surfels. 

2.2 The Digital Fundamental Group 

First, if a and /3 are two n— paths such that the last surfel of a is n— adjacent 
of equal to the first surfel of (3, we denote by a* /3 the concatenation of the two 
n— paths a and (3. 

Now we need to introduce the n— homotopy relation between n— paths. Let 
us consider A C A. First we introduce the notion of an elementary deformation. 
Two closed n— paths tt and tt' in A having the same extremities are said to be 
the same up to an elementary deformation (with fixed extremities) in X if they 
are of the form tt = tti * 7 * 7T2 and tt' = tti * 7' * 7T2, the n— paths 7 and 7' 
having the same extremities and being both contained in a common loop. Now, 
the two n— paths tt and tt' are said to be n— homotopic (with fixed extremities) 
in X if there exists a finite sequence of n— paths tt = TTg, . . . , = tt' such that 

for i = 1 , . . . ,m the n— paths tt^-i and are the same up to an elementary 
deformation (with fixed extremities). 

Let A G A be a fixed surfel called the base surfel. We denote by Ag{X) 
the set of all closed n— paths tt = (xg, . . . ,Xp) which are contained in A and 
such that Xq = Xp = B. The n— homotopy relation is an equivalence relation on 
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A^{X), and we denote by 7T"(X) the set of equivalence classes of this equivalence 
relation. If c € A'^{X) is a closed n— path, we denote by [c] G 7T”(X) the class 
of c for this relation. 

The concatenation of closed n— paths is compatible with the n— homotopy 
relation, hence it defines an operation on 7T"(X), which to the class of a and 
the class of /? associates the class of a * /3. This operation provides i7”(X) 
with a group structure. We call this group the n— fundamental group of X. The 
n— fundamental group defined using a surfel B' as base surfel is isomorphic to the 
n— fundamental group defined using a surfel B as base surfel if X is n— connected. 

Now we consider Y G X G X and B gY a, base surfel. A closed n— path in Y 
is a particular case of a closed n— path in X. Furthermore, if two closed n— paths 
of Y are n— homotopic (with fixed extremities) in Y, they are n— homotopic (with 
fixed extremities) in X. These two properties enable us to define a canonical 
morphism : nf{Y) — > 77"(A), which we call the morphism induced by the 
inclusion map i : Y — > X. To the class of a closed n— path a G A^{Y) in 
nf{Y) the morphism z* associates the class of the same n— path in ]Jf{X). 
The following is proved in [9]: 

Theorem 1. Let Y G X G X be n— connected sets. Then the two following 
properties are equivalent: 

1. The set Y is lower n— homotopic to X . 

2. The morphism z* : IIf(Y) — > IIf{X) induced by the inclusion map i : 
Y — > X is an isomorphism and each ri— connected component ofY contains 
a surfel of X. 



2.3 Homotopy in Subgraphs of n— adjacency Graphs 

Let G be a subgraph of the n— adjacency graph of a set A G A with the same set 
of vertices. Let tt and tt' be two paths in the graph G. We say that tt and tt' are 
homotopic in the subgraph G if there exists a finite sequence tt = ttq, . . . , TTm = 
of paths in G such that for z = 1, . . . ,m the paths 7Ti_i and tt*, as rz— paths, 
are the same up to an elementary deformation (with fixed extremities). The 
subgraph G of the n— adjacency graph of X is called simply connected if any 
path in G is homotopic in G to a constant path of the form {B, B) with B G X. 



3 About the Continuous Case 

Let us first recall some definitions about G^ surfaces embedded in R^. We denote 
by D the opened disk of with a unit radius and centered at (0, 0). 

A continuous map gr : D — > R^ is called G^ if all the partial derivatives of 
ip exist at any point (s,t) G R^, and are continuous functions of (s,t)- 
Let ip : D — > R^ be a G^ map. 

We denote v?(s, t) = {p\{s, t), P 2 {s, f),p^{s, t)) G R^. 
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The map (f> is called regular if for any (s, t) € D the two vectors 

are both non zero and are linearly independent in R^. 

Given two metric spaces X and Y, a continuous map / : X — > Y is called 
a homeomorphism if it is one to one and its inverse map is continuous. 

Definition 5 (C^ surfaces, local parametrization). A subset X of is 
ealled a surface embedded in (or surface for short) if for any x G X 
the two following properties are satisfied : 

1. There exists an opened neighborhood of x in R^ (we denote by the 
neighborhood Wx H X of x in X) ; 

2. There exists a homeomorphism ipx '■ D — > Vx which is and regular and 

with 0)) = X. 

Such a map (px is called a local parametrization of X in the neighborhood of x. 

We have to introduce the notion of an oriented surface. Under the notions of 
previous Definition 5, we first consider, for x,x' G X such that YxClYx' yf 0, the 
map 

r 5 ""' : v^f^Vx nVx^)^ g^^^iVx n Vx-) 

1 (sU) ' — ^ oipx{s,f) 



We also denote g^^'{s,t) = {gf^' (s,t), g^^' (s,t)) G R^. Then the two vectors 

(s,t), (s,t)) and (s,t), (s,t)) of R^ exist and form a basis 

of R^ for any (s,t). An oriented surface is a couple (A, {p-x)x£x) satisfying all 
the above properties, and such that for any x,x' G X such that 14 n 14' yf 0 and 
for any (s,t) G (pf^(Vx fl 14'), the basis 






of R^ is direct (i.e. has a positive orientation). Such a structure exists for any 
surface X embedded in R^. 

Let X he a oriented surface. A path c in A is a continuous map c : 
[0, 1] — > A. Such a path c is called closed if its two extremities c(0) and c(l) are 
equal. A curve is a path which, as a map, admits a nowhere zero continuous 
derivative. A simple curve is a curve c such that Vo,/? G [0, 1] [c{a) = c{(i) 4=^ 
a = j3]. X simple closed curve is a curve c such that Vo;,/3 G [0,1] [c{a) = 
c(/3) (a = ^ or {a, /3} = {0, 1})]. 

Let us consider two curves c\ and C 2 in A which have finitely many intersec- 
tions, and such that the directions tangent to c\ and ci are distinct at any of their 
intersections, and let a and j3 in [0, 1] be such that ci(a) = C2(/3) (in other words 
we consider an intersection between ci and C 2 ). We denote x = Ci{a) = C2(/3) the 

intersection point. We consider the two following vectors of R^: v = (q^) 

and w = (/?)■ We define the orientation number of the intersection 
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(a,/3) of Cl and C 2 , and we denote by 0(a, /3, Ci, C 2 ), the number equal to 1 if 
the basis {v, w) has a positive orientation in R^, and equal to —1 otherwise. This 
number depends on the oriented surface {X, {(fix)xex) and not only on the 
surface X. 

Now we introduce in an informal way the fundamental group of a connected 
subset X of R^. We consider B G X a, fixed point called the base point, and 
Ab{X) the set of all closed paths c in X such that c(0) = c(l) = B. We 
introduce on Ag{X) an equivalence relation of homotopy between paths which 
intuitively represents the relation of being continuously deformable one into each 
other inside X (with fixed extremities). Then, similarly to the discrete case, we 
denote by Bi{X, B) the set of the equivalence classes of paths up to homotopy in 
Ab{X), and the concatenation of paths defines an operation on IIi{X, B). This 
operation provides IIi{X, B) with a group structure, and this group is called the 
fundamental group of X. A set X is called simply connected if its fundamen- 
tal group reduces to a singleton (i.e. any path of Ab{X) can be continuously 
deformed to a point inside X. 

Now we can explain how to construct a presentation of the fundamental 
group of an oriented surface {X,{ipx)x^x)- First we have to construct a finite 
set {tti, . . . , TTm}, where for any z G { 1 , . . . , m} the element is either a simple 
curve or a simple closed curve, satisfying all the following properties : 

(Pi): Vz,j G {1, . . . ,m} with z yf j and Va,/3 g] 0, 1[ we have 7Ti(a) yf Xj(/3) (in 
other words the curves can intersect only at their extremities) ; 

(P 2 ): Any curve tt^ which is not simple closed must intersect at least another of 
the TTj’s at each of its extremities ; 

(P 3 ): The complement A\{ 7 Ti(a) j i = 1, ... ,m and a G [0, 1]} of the images 
of all the curves is connected and simply connected. 





Fig. 2. Example with a torus. 



Fig. 3. An intersection point. 



Let us consider for example the torus represented by Figure 2. We have 
represented two (simple closed) curves tti = a, and 7T2 = b. All the extremities 
of a and b are at the same point x. The complement in the torus of these two 
curves is a topological disk. 

Now let us consider the surface of genus 2 depicted by Figure 4. Here we 
have 6 curves a, b, c, d, e and /. The curves c and e are simple closed and the 
other curves are not closed. The complements of these 6 curves in the surface 
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is a topological disk. There are three points at which we have intersections of 
curves. 

Now let us introduce the presentation we have for the fundamental group of 
X. We consider a base point B in the complement of the images of the curves 
TTj. We take m generators {oi, . . . ,am}, one for each of the curves tt^. Let us 
describe the words we take as relators : For any extremity 7rj(a) with a G {0, 1} 
of a curve with i G {1, • . • ,m}, we consider the word R{TTi{a)) in the Oj’s 
and corresponding to the curves nj which appear cyclically around the 

extremity ni{a) (see Figure 3). The order in which the a/s and appear in 

the word R(TTi(a)) is precisely the cyclic order of the curves iTj which meet at the 
point ni{a). Now, whether we get in the word R{ni{a)) an aj or an aj^ depends 
on the orientation of the curve ttj, i.e. depends on whether 7Ti(a) = 7Tj(0) or 

7Ti(a) = 7Tj(l). 

For the example of the torus depicted by Figure 2, we have two generators 
a and b. The TTj’s all have the same extremity x, and the word R{x) is up to 
conjugacy the word aba~^b~^, commutator of a and b. 

For the example of the surface of genus 2 represented by Figure 4, we have 
three words R{xi), R{x 2 ) and R{x^), corresponding to the three intersection 
points Xi, X 2 and x^ from the left to the right on the figure. We have (up to 
conjugacy) R{xi) = acb~^c~^, R{x 2 ) = da~^f~^b, and R{x 3 ) = ed~^e~^f. 

Now we consider TZ the set of all words R{ni{a)) for f = 1, . . . , m and a G 
{0, 1}. We denote by T the group with m generators and the relations of TZ. The 
fact is that the fundamental group of the surface X is isomorphic to R. Let 
us explain how the isomorphism is defined : 

Let us consider a closed path c of Ab{X). First we want to construct a word 
w in the generators. To do this, we first chose a path c' , which is a curve, can be 
continuously deformed into c inside X, and which has a finite intersection with 
all the curves tt^, i = 1, . . . ,m (such a curve always exists). Moreover, we may 
assume that at each intersection between c' and some tt^, the tangent directions 
to c' and are not parallel at this intersection. 

Then, to construct the desired word w{d), we go over the curve c' , adding to 
the word we construct a symbol '> each time we have an intersection 

7Ti(a) = c'{P). The order in which the symbols appear in the word w{d) is the 
order in which the intersections with the tt^’s appear along c'. Then we have : 
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Theorem 2. The projection of w{d) € Tm on the group T depends only on 
the class of the path c in the fundamental group. We denote this class by V\i{c), 
which is an element of T. 

Theorem 3 (Presentation of the fundamental group). If we denote by 
[c] the class of a path c of Ab(H) in the fundamental group, then the map 
[c] I — > yV(c) is well defined, and it is a group isomorphism from IIi{X,B) onto 
the group T. 

For example, for the torus depicted by Figure 2, the fundamental group has 
two generators a and b with the single relation aba~^b~^, and the fundamental 
group is therefore isomorphic to the group (Z^, +). For the example of Figure 4, 
we have 6 generators and 3 relators as we noticed above. 

In fact, in the continuous case, it is always possible to construct the curves 
TTj’s in such a way that all of their extremities are at the same point. In this case, 
the presentation we obtain for the fundamental group has a single relation. We 
shall see that it does not work the same in the discrete case. 

4 The Discrete Case 

4.1 Simple Curves on a Digital Surface 

We remind the reader that we have defined a digital surface S, and n G {e,v}. 
In the sequel we shall assume that n = e and n = u. In other words, we shall 
analyze a fixed e— connected subset X of the set of the surfels of X with the 
e— connectivity relation, and the complement X oi X in X with the u— connec- 
tivity relation. Our purpose is to compute a presentation of the e— fundamental 
group of X. We first want to define a set {tti,... ,TTm} of curves satisfying 
properties analogous to the three properties (Pi), (P 2 ) and (P 3 ) of the continuous 
case. The kind of curves which are to be considered here have a thickness 0. 

Definition 6 (Discrete simple curve on a surface). A oriented simple curve 
on the surface X is a sequence it = (ci, . . . , Cg), with q > 1, where Cg = (r^, Ig) 
is for s = 1, . . . ,q a couple of surfels with rg e— adjacent to Ig, satisfying the 
three following properties : 

1. For s = 1,... ,<7—1, the surfels rg,lg,rg+i and Ig+i are all contained in a 
unique common loop Lt^{s). For convenience, if q = 1 we chose L,r(0) and 
L^(l) two distinct loops which contain r\ and l\. Ifq >2 we denote by T,r( 0 ) 
the unique loop which contains r\ and l\ and is distinct from P,r(l); o,nd we 
denote by LT^{q) the unique loop which contains rq and Iq and is distinct from 
Lniq - 1 ) ; 

2. For s = 1, . . . ,q — 1, the surfels Ig and are e— connected in 

L^(s)\{rs, Ts+i} and the surfels Vg and r^+i are e— connected in 
^Tr{s)\{lg,lg+l} ,' 

3. The loops Lt^{s) for s = 0, . . . ,q are all distinct. 
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Definition 7 (oriented simple closed curve). An oriented simple closed 
curve on S is a sequence such as defined by the previous definition of oriented 
simple curves, but satisfying the following property 3’ instead of the property 3 : 

3’ For s,t € {0, ... ,q} we have [L7t(s) = {s = t or {s, t} = {0, g})] . 

Definition 8. Let tt = (ei, . . . , e,) and tt' = (e(, . . . , e'^,) be two oriented simple 
curves or simple closed curves on E. We say that the curves tt and tt' have no 
interior intersection if and only if for s G {0, . . . ,q} and for t G {0, . . . , q'} with 
s ^ {0, g} or s' ^ {0, g'} we have : Lt^{s) LT^fit). 

4.2 Construction of an Appropriate set of Curves 

As we said before, we want to proceed in the discrete case as we did for the 
continuous case, constructing a set {tti,... ,TTm} of simple curves and simple 
closed curves satisfying properties analogous to the properties (Pi), (P2) and 
(P3) of the continuous case. However, in the discrete case we shall not only 
compute a presentation of the fundamental group of S, but we shall do it for 
any e— connected subset X of E. Hence, in the sequel of this paper, X denotes 
a set of surfels of E, and P G A is a fixed base surfel. We want to construct 
a set {tti, . . . , 7Tm} of simple curves and simple closed curves (we denote = 
(e(,... ,e*J for i = 1,... ,m and e( = (r*,l()), on E satisfying the following 
properties : 

(Pi): Vi,j G {!,... ,m} with i j, the curves tt^ and Tij have no interior 
intersection ; 

(P2): For any i G {!,... ,m} and any s G {0,(71}, there exists j G {!,... ,m} 
and t G {0, qj}, with i j or s t, such that ; 

(P3): The subgraph of the e— adjacency graph of X, obtained by removing in 
the e— adjacency graph of X all the edges of the form {ll,rl}, is connected 
and simply connected. 

To construct such a set of curves, we first construct a connected and simply 
connected subgraph of the e— adjacency graph of X, having X as set of vertices. 
A way to do this is to consider a covering tree G{X) in the e— adjacency graph of 
X. Such a covering tree is a connected subgraph having X as set of vertices, and 
with no cycle so that it is simply connected. Efficient algorithms to construct 
covering trees in connected nonoriented graphs exist and can be found in most 
of books on elementary algorithms. 

We can consider the set A of the pairs |a;i,X2} of e— adjacent surfels of X, 
such that |xi,a;2} is not an edge of the subgraph G(X). As we shall see, from 
the data of the elements of A, we can construct a set of oriented curves satisfying 
the properties (Pi) and (since G(X) is connected and simply connected) (P3). 

Definition 9. Let Ad be a set of pairs of e— adjacent surfels of X. An edge 
a G A! , such that there is a loop I included in X which contains a, and such that 
for any <j' G A' with a' a we have o' (f I, is called an extremity of A . 




148 Remy Malgouyres 



Because of the existence of extremities in the set A, the set of oriented curves 
we can construct from the data of A does not always satisfy the property (^2)- 
What we can do is remove iteratively extremities in A until we get a set which 
contains no more extremities. More precisely, we set Aq = A, we iteratively 
choose an extremity ak in the set Ak and set A^+i = ^fc\{CTfc}, until we get 
a set Af which contains no extremity. For k = 0, . . . , /, we denote by Gk{X) 
the graph whose vertices are the surfels of X, an edge of Gk{X) being either 
an edge of G(-^), or an edge of A\Ak- The graph Gk{X) is a subgraph of the 
e— adjacency graph of X. We have the following lemma : 

Lemma 1. For k = 0,... , /, the graph Gk{X) is connected and simply con- 
nected. 

There remains to construct the oriented curves tt^’s from the data of Af. 
Let us consider a = {ri,li} an edge of Aj. We denote Ci = (riJi). Then the 
sequence (ei) with a single term is a simple curve on S. Hence we can construct 
a set of simple curves and simple closed curves {tti, . . . , tt™} such that, denoting 
= (ej,... ,e*J for i = 1 ,... ,m and e* = (r*,Z*), the map <F : 1 — > 

{r® , l\} is a one to one correspondence between the set {e® / i = 1 , . . . ,m and 
s = 1, . . . ,Qi} and the set Af. We can for instance consider curves reduced to one 
edge, but it is also possible to consider longer curves, by extending the curves 
in loops containing exactly two edges of Af, so that we get a lower value of m. 
By constructing the simple curves and the simple closed curves tt^ in this way, 
the TTi’s satisfy the property (Pi). Moreover, since, from its very construction, 
Af has no extremity, the tt^’s satisfy the property (^2)- At last, from Lemma 1, 
the TTi’s satisfy the property (P3). 

4.3 Main Results 

We consider {tti, . . . , Tr^} a set of curves and simple closed curves satisfying the 
properties (Pi), (P2) and (P3) defined above. It follows from Subsection 4.2 that 
such a set of curves can always be constructed. We denote tti = (ej, . . . , e® J and 
e® = (r® , I® ) for i = 1 , . . . , to and s = 1, . . . ,Qi. 

As when we have defined the free group Tm, we consider the alphabet 

{oi , . . . , Ctrri: ^i j ■ ■ ■ 7 } 

with 2 to letters, and Wm the set of all word on this alphabet. 

Let c = (xq,... ,Xp) be any e— path in X. We want to construct a word 
w{c) G yVm associated with c. 

Definition 10. Let fee {0, . . . ,p — 1}, i G {!,... , to} and s € {1, ■ . ■ , Qi}, be 
such that {rl,ll} = {xfe,a;fc+i}. The triple (k,i,s) is called an intersection of c 
and the ttj’s. We say that this intersection is positive if {r\,l],) = {xk,Xk+i), 
and negative if {ll,r\) = {xk,Xk+i). 

Notation: Given (k,i,s) an intersection of c and the tt^’s. We denote by 
Oc{k,i,s) the number equal to +1 if the intersection (k,i,s) is positive, and to 
— 1 otherwise. 
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We define w{c) as the word, containing one symbol for each inter- 

section (fc,i,s) of c with the tt^’s, these symbols appearing in the word w{c) in 
the order of the increasing k (i.e. the order in which the intersections appear 
along c). We denote by w{c) the class of w{c) up to elementary simplifications 
(see the definition of the free group) . 

Now we can define a group F given by generators and relations. Let us 
consider a loop with f G {1, . . . , m} and s G {0, qt} (which is an extremity 

of the curve iTi). we assume that the loop is contained in X. As we 

observed when defining loops, the loop admits a parametrization, which 

is a closed e— path covering We denote by c(i, s) such a parametrization. 

We denote by R{i, s) the word w{c{i, s)), and by TZ the set of all words R{i, s) 
for all f G {1, ... , m} and s G {0, qi} such that the loop (s) is contained in 
X. Note that, since the parametrization c(f,s) of the loop is not exactly 

unique, but depends on an initial surfel and the orientation of a rotation in 
the word R{i, s) is only defined up to cyclic permutation or inversion in 
the free group. The normal subgroup of Fm generated by TZ does not depend 
on this choice of the parametrization c{i, s) since two cyclic permutations of the 
same word correspond to conjugate elements of Fm- 

We denote by FI the normal subgroup of the free group Fm generated by the 
words of TZ. As in the definition of the quotient group, we denote by =h the 
relation of equality modulo elements of FT in Fm- Finally, we denote F = Fmj FT 
the quotient group, and we denote by pu : Fm — F the projection. The group 
F is the group with m generators and the relations of TZ. We want to prove that 
the e— fundamental group of X is isomorphic to the group F. More precisely, we 
have the two following results : 

Theorem 4. The map pn ° w : A‘^{X) — > F is constant on each equivalence 
class of Ag{X) for the e—homotopy relation in X. Therefore, this map pn ° w 
induces a map 

(p:n!{X)^F 

1 [ c ] I — >ph{w{c)) 

Obviously, since concatenation of paths corresponds to concatenation of words, 
this map p is a group morphism. 

Theorem 5. The map p : TTf(X) — > F is a group isomorphism. 

The idea is that if we consider (fc, i, s) with s G {0, qi} an intersection of an 
e— path c in A and the TTj’s, and if c(i, s) denotes a parametrization of the loop 
L-Ki{s), then the closed e— path c(i, s), since contained in a loop, is e— homotopic 
in X with fixed extremities to a path reduced to a single surfel. In other words, 
the element of the e— fundamental group of X represented the closed e— path 
c{i, s) is the unit element of the e— fundamental group. Through the isomorphism 
p from TTf{X) onto F = Fm/H, this is translated into the fact that the element 
of Fm represented by the word R{i, s) = w{c{i, s)) is sent onto the unit element 
of F. Therefore, R{i, s) must belong to FT, but this is precisely the definition of 
H. 
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Conclusion 

We can compute a presentation for the fundamental group of any subset of 
a digital surface. Some principles presented here can be reused to study the 
complete 3D case. However, the ideas of this paper must be widely modified to 
be used in the 3D case. Mainly, it seems that, in the 3D case, the curves tt^’s 
which we construct must be surfaces in the 3D space. Of course, the junctions 
between surfaces are much more difficult to characterize than intersections of 
curves on a surface. For this reason, the 3D case seems much more difficult than 
the case of surfaces. 
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Abstract. The problem of reconstruction of two-dimensional discrete 
sets from their two projections is considered in different classes. The re- 
construction algorithms and complexity results are summarized in the 
case of hu-convex sets, hw-convex polyominoes, hu-convex 8-connected 
sets, and directed h-convex sets. We show that the reconstruction al- 
gorithms used in the class of hu-convex 4-connected sets (polyominoes) 
can be nsed, with small modihcations, for reconstructing /lu-convex 8- 
connected sets. Finally, it is shown that the directed h-convex sets are 
uniquely reconstructible with respect to the row and column sum vectors. 



1 Introduction 

One of the most frequently studied reconstruction problems is the construction of 
2-dimensional discrete sets from their row and column sum vectors. This kind of 
questions can arise, for example, in electron microscopy [9], image processing [18] 
and radiology [16]. The general problem of reconstructing binary matrices (or, 
equivalently, discrete sets) has been extensively studied. Existence, consistence 
and uniqueness results are known (as a summary see [3]) and reconstruction 
algorithms are published (e.g., [6,17]). Since, in general, the number of solutions 
can be very high, there is an interest to reconstruct binary matrices/discrete 
sets having special properties. Kuba published an algorithm to reconstruct so- 
called two-directionally connected discrete sets [15]. Del Lungo et al. studied 
the reconstruction of different kinds of polyominoes [11,1]. Recently, Chrobak 
and Diirr found reconstruction algorithms for special polyominoes. Since the 
reconstruction in certain classes can be too complex (see [19]), the classes, where 
the reconstruction can be performed in polynomial time, are the most important 
for the applications. 

This paper deals with the reconstruction problem in different classes of dis- 
crete sets. The most frequently used property is some kind of discrete version of 
the convexity. For example, we can suppose that the elements of the discrete sets 
in each row and column are consecutive. Since this property itself is not enough 
to have a reconstruction problem which is solvable in polynomial time (see [19]), 
further properties can be included (e.g., connectedness). From the literature five 
properties are collected (/i-convex, u-convex, 4-connected, 8-connected, and di- 
rected discrete sets) and the classes constructed from these combinations are 
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studied from the viewpoint of reconstruction. As a result of this survey study, 
we have found classes where known algorithm, after small modifications, can be 
used for reconstructing discrete sets of other classes. Also, a new reconstruction 
algorithm is given. 

First, the necessary definitions and notations are introduced. Then in the 
next sections the reconstruction problem in the following classes are discussed: 
/lu-convex sets, hu-convex polyominoes, /iw-convex 8-connected sets, and directed 
/i-convex sets. 



2 Definitions and notations 

Let 2Z^ denote the 2-dimensional integer lattice. In this paper the elements of 
2Z^ will be called points or positions. The finite subsets of will be called 
discrete sets. The class of discrete sets will be denoted by T . 

Let F be a discrete set. Then there is a discrete rectangle T of size mxn (m 
and n are positive integers), 

T= {!,... ,m} X {!,... ,n}, (1) 

such that T is the smallest discrete rectangle containing F. The discrete set F 
can also be represented as a binary matrix (fij)mxn, fij G {0, 1}, such that 



\ 0, otherwise. 

For any subset of T we define the i-th row of the subset, 1 < i < m, as its 
intersection with i x {!,..., n}. Similarly, the j-th column, 1 < j < n, of the 
subset is its intersection with {!,... ,m} x j. 

Let IN denote the set of positive integers. For any discrete set F we define 
its projections by the operations H and V as follows. H : F — > IN’”, 7f(F) = 
H = (hi, . . . , hm) where 



= i = l,...,m, (2) 

i=i 

and V : F — > IN", V(F) = V = (vi, . . . ,w„) where 

m 

= j = l,...,n. (3) 

i=l 

The vectors H and V will be called the row and column sum vectors of F, 
respectively. 

Let F and F' be discrete sets. We say that F and F' are tomographically 
equivalent (w.r.t. the row and column sum vectors) if 



n{F) = n{F') and V(F) = V(F'). 



( 4 ) 
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The discrete set F is unique (w.r.t. the row and column sum vectors) if there is 
no discrete set F' F) being tomographically equivalent to F. 

Let 5 be a class of discrete sets. The discrete set F G ^ is determined (by 
their projections) if there is no tomographically equivalent set in the class Q . 

We are going to study the following problem in different classes of discrete 
sets. 

Reconstruction(^). 

Instance: Two vectors FI G IM™ and V G IN". 

Task: Construct a discrete set F G G such that 'H(F) = FI and 

V(F) = V. 

Different classes of discrete sets will be introduced by the following defini- 
tions. 

Let P and Q be two points in Let us consider the Euclidean distance 
between P and Q, 



\\p-Q\\ = V{pi-qir + {p2-q2r. (5) 

The points P and Q are said to be ^-adjacent if |jP — <5|| < 1 and 8-adjacent if 
ll^*— <511 < (Explicitly, the 4-adjacent points of (i,j) are {i — 1, j), {i,j — 1), 
{i,j + 1) and {i + 1, j), i.e. the next points in directions north, west, east and 
south, respectively, and the point (i, j) itself.) The sequence of distinct points 
(io,jo), • • • , (*fc, jfe) is a 4-path/ 8 -path from point (io,jo) to point (ik,jk) in a 
discrete set F if each point of the sequence is in F and is 4-adjacent/8- 

adjacent, respectively, to (q_i,jf_i) for each I = Two points are 4- 

connected/8-connected in the discrete set F if there is a 4-path/ 8-path, respec- 
tively, in F between them. A discrete set F is /-connected/ 8-connected if any 
two points in F are /-connected/ 8-connected, respectively, in F. The 4-connected 
set is also called polyomino (see, for example, Golomb [12]). The classes of 4- 
connected and 8-connected sets will be denoted by (04) and (cg), respectively. 
From the definitions it follows that (04) C (cg). 

The discrete set F is horizontally convex (or, shortly, h-convex) if its rows 
are 4-connected. The class of /i-convex sets will be denoted by {h). Similarly, 
a discrete set F is vertically convex (or, shortly, v-convex) if its columns are 
4-connected. The class of w-convex sets will be denoted by (w). The h- and v- 
convex sets will be called hv-convex and their class will be denoted by (h,v). 
Clearly, (h,v) = (h) fl (u). Using this notation, the hv-convex polyominoes are 
the class of (04, h, v). 

The sequence of distinct points (tg, jo)) • • • > (*fc, Jfc) is an EN-path from point 
(*0)Jo) to point (ik,jk) in a discrete set F if each point of the sequence is in F 
and (ii,ji) is in east or north to (q_i, j;_i) for each I = 1, . . . ,k. The discrete 
set F is directed if there is a particular point of F, called source, such that there 
is an EN-path in F from the source to any other point of F. It follows from the 
definition that the source point of a directed set is necessarily the point (m, 1). 
The class of directed sets will be denoted by (d). Clearly, (d) C (04). 
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Fig. 1. Examples of discrete sets in different classes, a) fe-convex discret set; b) w-convex 
discrete set; c) 4-connected discrete set; d) 8-connected discrete set; f) directed discrete 
set. 



3 Reconstruction of h,t;-convex sets 

The reconstruction of hv-convex sets was studied by Kuba [15]. He gave a greedy 
type heuristic algorithm, by which the set F to be reconstructed could be approx- 
imated by two sequences of discrete sets, and {5'^*^}. The first sequence is 

nondecreasing and it consists of hv-convex sets supposed to be the subsets of F, 
called core sets. The other sequence is nonincreasing and its consists of discrete 
sets supposed to contain F, called envelope sets. As initial core and envelope 
sets we can use 



= 0 and = T. (6) 

The core set in each iteration will be constructed from the (previous) envelope 
set S and from the projections F[ and V as follows. First, we create the discrete 
set Kh{S,H), the intersection of all h-convex sets contained by S and having 
horizontal row sum FI . (It can be computed easily row by row. For example, 
if the z-th row of S consists of (z, s(), (z, s( -b 1), . . . , (z, s") (1 < s' < s" < n) 
and /zi < s" — s' -b 1 < 2 * /zi — 1 then the z-th row of Kh{S, H) is the set of 
{(z, s" — hi -b 1), (z, s'' — /zi-b2), . . . , (z, s' -b/zi — 1)}.) Similarly, using the envelope 
S and the column sums V the set Ky{S,V) can be defined. Clearly, if F C S' 
then Kh{S, FI) C F and Ky{S,V) C F. Therefore, we can select the intersection 
of all hv-convex sets containing Kh{S, H) U Ky{S, V) as the new core. (See Fig. 
2.) 

Analogously, the envelope set in each iteration will be constructed from the 
(previous) core set K and from the projections H and V as follows. First, we 
create the discrete set Sh{FC,H), the union of all h-convex sets containing K 
and having horizontal row sum H. (It can be computed easily row by row. 
For example, if the z-th row of K consists of (z, fc'), {i.k[ + 1), . . . , (z, k") (1 < 
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a 



b 



Fig. 2 • Construction of core and envelope sets, a) The construction of the core set 
from the envelope set S = T and from the projections. The elements of Kh{S,H) and 
K^[S, V) are denoted by horizontal and vertical thick line segments, respectively. The 
squares denote the new elements of the intersection of all hv-convex sets containing 
Kh{S, H) U Ky{S, V). b) The construction of the envelope set from the core set con- 
structed in a). The elements of Sh{K, H) and Sv{K, V) are denoted by horizontal and 
vertical line segments, respectively. 



k[ < k” < n) and k" — k[ + 1 < hi then the z-th row of Sh{K,H) is the set 
of {(z, k" — hi + 1), (z, k" — hi + 2),. . . , (z, k'i + hi + 1)}.) Similarly, using the 
core K and the column sums V the set Sv{K,V) can be defined. Clearly, if 
K C F then F C Sh{K, H) and F C Sv{K, V). Therefore, we can select the set 
Sh{K, H) C\ Sv{K,V) as the new envelope. 

During the iterations we have 

iCWcFCS'('). (7) 

If we have for some I then F = is a solution. It may also 

happen that during the iterations we reach a situation when core or envelope 
do not change or In the first case, we can take an arbitrary element 

of the set \ add it to the core and continue the algorithm with this 
new core set. In the second case we have a contradiction indicating that there 
is no solution between the current core and envelope. Then we can return to 
the arbitrary selected element (if we made such a formation trial earlier), take 
it away from the envelope set and continue the algorithm with this reduced 
envelope. 



4 Reconstruction of h,t;-convex polyominoes 

This class of discrete sets was studied first by Del Lungo et al. [11,1]. The 
method was improved [2,4] and the latest version is able to reconstruct /zv-convex 
polyominoes in time 0{nm ■ log(nm) • min{m^, rz^}). 

This algorithm uses also the iterative procedure applied for /zz;-convex sets, 
but it is basically different in two points: selection of the initial core and handling 
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the situation when the core and envelope can not be changed in the way described 
in Section 3. 

For the selection of the initial core the concepts of feet of polyominoes are 
introduced. The columns of F that have element in the first row of T are called 
the north foot of F, denoted by Pn- Analogous definitions can be given for the 
south, east, and west feet, Ps, Pe- and Pw, respectively. The algorithm starts 
with choosing the positions of the four feet. (There are at most 0(min{m^, n^}) 
possible cases. Some of them can be sorted out soon at the beginning of the 
reconstruction.) Then the /lu-convex polyomino containing the selected feet (as 
an initial core) is constructed. Using the iterative procedure described in Section 
3 the core set is increased and the envelope set is decreased. 

If the core and envelope sets can not be changed then the reconstruction 
is reformulated as a 2-Satisfiability problem (also referred to as 2SAT). Such 
situations can be described by a 2SAT expression, i.e., a Boolean expression in 
conjunctive normal form with at most two literals in each clause. The 2SAT 
expression is satisfiable if and only if there is an /iv-convex polyomino solution 
of the reconstruction problem. 

Recently, Chrobak and Diirr have found a new algorithm [8] reconstructing 
/lu-convex polyominoes in 0{mn ■ min{m^,n^}) time. The basic idea of their 
algorithm is to rewrite the whole reconstruction problem as a 2SAT problem 
(see Section 5). 

5 Reconstruction of h,t;-convex 8-connected sets 

This class of discrete sets was studied first by Brunetti et al. [4]. They showed 
that a modification of the algorithm suggested for reconstructing hu-convex 
polyominoes can be used even in this class. The complexity of the algorithm 
is 0{nm ■ log(nm) • in the worst case. 

In order to show the strong connections between the classes of hv-convex 
8-connected sets and hv-convex 4-connected sets (i.e., polyominoes), we present 
the corresponding lemmas for both classes. 

Lemma 1. Let F be a hv-convex polyomino with row and column sums H and 
U. If 






i-1 



i-l 



^vi>^hk, ^hk> ^ vi, (8) 


















i^j+i 



1^3 






then {i,j) G F. 
Proof. See [2]. 



Lemma 2. Let F be a hv-convex 8-connected set with row and column sums H 
and V . If 
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j i—1 i i Ti n i—1 

'^hk>'^vi, '^hk> vi, '^vi>'^hk (9) 

k—l k—l 1 — 1 k—1 l-j~\-l l—j k—1 

then (i,j) G F. 

Proof. See [4]. 

In this section we show that the modification of the algorithm suggested by 
Chrobak and Diirr [8] for reconstructing ft,u-convex polyominoes can be used in 
the more general class of {cs,h,v). They showed that the reconstruction of a 
/lu-convex polyomino was equivalent to the evaluation of a suitable constructed 
2SAT expression. Now, we give the description of the modified algorithm follow- 
ing the same idea as Chrobak and Diirr. Since the statements can be proved in 
the same way, we omit the proofs. 

We say that the discrete set A is an upper-left corner region in the discrete 
rectangle T containing F if {i-\- 1, j) G A or {i,j -I- 1) G A implies (i,j) G A. The 
upper-right, lower-left and lower-right regions can be defined analogously. Let F 
denote the complement of F (in T). 

Lemma 3. F G (cg, h, v) if and only if 

F = AU BUCU D, (10) 

where A, B, C , and D are disjoint corner regions (upper-left, upper-right, lower- 
left and lower-right, respectively). 

Proof See [8]. 

Remark 1. Lemma 3 is not valid in the class (c 4 , h, v). In the class (c 4 , h, v) two 
further conditions should be satisfied: 

(i) (* - 1, J - 1) G -4 implies (i,j) ^ D, 

(ii) {i — l,j -\- 1) G B implies {i,j) ^ C. 

See Fig. 3. 




Fig. 3 •An 8-connected hv-convex set which is not 4-connected. 
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We say that F is anchored at {k,l) {1 < k,l < m) if (fc,l),(?,n) G F. 
The basic idea of the reconstruction method is to construct a 2SAT expression 
Fk,i{H, V) with the property that Fk^i{F[, V) is satisfiable if and only if there is 
an F G (cs,h,v) that is anchored at (k,l). The suitable Fk^i{H,V) in the class 
(cs, h,c) can be constructed as 

Fk,i{H, V) = Cor A Dis A Anc A LBC A UBR, (11) 

where Cor, Dis, Anc, LBC , and UBR are sets of clauses describing the proper- 
ties of ’’Corners”, ’’Disjointness”, ’’Anchors”, ’’Lower bound on column sums”, 
and ’’Upper bound on row sums”, respectively, in the following way. 



Cot — A Oij A ij' A bij 6^ j_|_i) A 

D+ij' A Cij A l\j,dij A dij 

Dis = y^ j \ for symbols X,Y G {A, B, C, D}, X j^Y}, 

id 

Anc = dk,i A bk,i A Ck,i A dk,i A A A A 

LBC = /\( Qij A dij A 

'iJ 

A /\{cvj,j A dv^j), 

i,j 3 



UBR^ A A A 

j \z<min{fc,Z} 



Ki<k 



A A (cij' A A ^i,j-\-hi) I ■ 



3 \k<i<l max{fc,/}<i J 

Remark 2. In the case of /iw-convex polyominoes one more set of clauses was 
included describing the ’’Connectivity” (more exactly, the 4-connectivity) of F 
as 



Con — l\j\^ij A bij Ci-|_i^j_i). 






(12) 



Then the reconstruction algorithm can be given as 

Algorithm 1 for reconstructing /lu-convex 8-connected sets 

Input. Two vectors, H and V . (It is supposed that H G IN™ and V G IN", 
1 — n for all i = l,...,m, 1 < Vj < to for all 

j = 1, . . . , n, and m < n.) 
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Step 1. for /c, / = 1, . . . , TO 

{ 

if Fk i{H, V) is satisfiable then output F = A\J B C U D and halt. 

} 



Output, print ”No solution”. 



Theorem 1. Fkj{H,V) is satisfiable if and only if there is an F ^ (cs,h,v) 
having projections H and V and it is anchored at (k,l). 

Proof. See [8]. 

Theorem 2. Algorithm 1 solves the reconstruction problem for 8-connected hv- 
convex sets in time 0{mn ■ min{TO^, n^}). 

Proof See [8]. 

6 Reconstruction of directed ^.-convex sets 

The class of directed /iw-convex polyominoes was studied by Del Lungo et al. in 

[ 111 - 

In this section we consider a more general class, the class of directed /i-convex 
sets, (d,h). The following reconstruction algorithm is a straight consequence of 
the definition of the class. 

Algorithm 2 for reconstructing directed /i-convex sets 

Input. Two vectors, Ft and V . (It is supposed that FI G IM™ and V € IN", 
XI i 1 — — n for all i = 1 ,..., to , 1 < vj < to for all 

j = 1, . . . ,n, and to < n.) 

Step 1. Let B = (O)mxn be the initial binary matrix representing the discrete 
set to be reconstructed. 




Fig. 4 .Reconstruction of a directed fe-convex set 
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Step 2. for j = 1, . . . , n 

{ 

if vj < E™ 1 K then halt (no solution); 

otherwise let i' is the maximal row index where E/=i ^ (if there is 
no such row then halt (no solution)) and start Vj — Efc=i ^ij number of rows 
(in rows i' , i' + 1, . . .) of I’s from the column j consecutively. 

} 



Theorem 3. If F is a directed h-convex set then it is reconstructed by Algorithm 

2 . 

Proof. It follows from the definition of directed /i-convex sets. 

It is also clear that there is at most one directed ft,-convex set with given 
projections. Finally we can remark that similar algorithm and theorem can be 
given for directed w-convex sets. 
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Abstract. Given a finite set of points sampled from a curve, we want 
to reconstruct the ordering of the points along the curve. Every ordering 
of the sample points can be defined by a polygon through these points. 
We show that for simple, regular curves Traveling Salesman Paths give 
the correct polygonal reconstruction, provided the points are sampled 
densely enough. In this case the polygonal reconstruction is part of the 
Delaunay Triangulation of the sample points. We use this observation to 
design an efficient algorithm for the reconstruction problem. 
Keywords. Curve Reconstruction, Traveling Salesman Problem, Mini- 
mum Spanning Tree 



1 Introduction 

Reconstruction of an object from an unorganized set of sample points is a fun- 
damental problem in many areas of science. Here we address the problem to 
reconstruct a curve in arbitrary dimension via a finite set of sample points. 

Following Amenta et al. [2], we state the curve reconstruction problem as 
follows 

Given a curve 7 in and a finite set of sample points S' C 7. A 
polygonal reconstruction P{S) of 7 from S is a graph that connects all 
points adjacent along 7. 

Thus an algorithm for curve reconstruction should reconstruct the order of 
the sample points along 7. That seem to be a natural claim because the main 
geometric characteristics of curves, length and total curvature, are defined via a 
limit over inscribed polygons [1]. The polygonal reconstruction is the inscribed 
polygon through the sample points. For dense samplings the length and the total 
curvature of the polygonal reconstruction approximate the length and the total 
curvature of 7. This is no longer true if one considers polygons, as in Attali [3], 
that are only near to 7 in the Hausdorff or Frechet metric. Given the sample 
points and no other information the polygonal reconstruction is in general the 
best way to estimate the length and the total curvature of 7. 

Of course there is no algorithm that can reconstruct any curve from any set 
of samples. We have to introduce restrictions on 7 and S. All known algorithms 
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demand dense sampling. That is, these algorithms guarantee to compute a poly- 
gonal reconstruction, if they do it at all, only for sufficiently dense samples. The 
most common restrictions on 7 are that 7 is a smooth, simple (closed), plane 
curve. 

The plane version of the curve reconstruction problem attracted some at- 
tention in recent years. O’Rourke et al. [10] published the first heuristic for this 
problem. Later on many algorithms and geometric graphs were designed to solve 
the reconstruction problem. But only very recently two algorithms were found 
which provably connect the sample points in the right order, provided the points 
were sampled densely enough: The algorithm of Bernadini and Bajaj [5] is based 
on a-shapes, a geometric structure introduced by Edelsbrunner [7] to model the 
shape of point sets. Amenta et al. [2] showed that the crust, a geometric graph 
defined in their article, and a /3-skeleton, another geometric graph introduced by 
Kirkpatrick and Radke [8], solve the reconstruction problem for plane curves. 

For some applications it is also interesting to reconstruct curves in higher 
dimension. An example is given by iso-line extraction in data from numerical so- 
lutions of partial differential equations. Furthermore the construction of Bezier or 
B-Spline curves through a set of sample points needs an ordering of these points. 
Such an ordering is induced by the ordering along the polygonal reconstruction. 
We show that Traveling Salesman Paths and Minimum Spanning Trees solve 
the reconstruction problem for regular curves in arbitrary dimension, provided 
the points are sampled densely enough. This might seems obvious, but there are 
quite well behaved curves for which this is not true. That is interesting from 
a theoretical point of view, because in 1930 Menger [9] suggested to base the 
geometry of curves and surfaces on minimum area triangulations. 

We have implemented our reconstruction algorithm, an algorithm for the 
computation of Minimum Spanning Trees and a factor- 2-approximation algo- 
rithm [6] for the Traveling Salesman Problem based on Minimum Spanning Trees. 
It turned out that the approximation algorithm is not suitable for the curve re- 
construction problem. These implementations can be found as a JAVA applet (a 
JAVA 1.1 capable browser is needed) at http://www.inf.ethz.ch/personal/ 
giesen/. 

This article is organized as follows: First, we give the definitions of regularity 
and samples. Second, we show that Traveling Salesman Paths and Minimum 
Spanning Trees solve the reconstruction problem. Third, we present a simple and 
efficient algorithm that also solves the reconstruction problem. This algorithm 
is better suited for practical purposes, where in general no guarantee is given 
that the points are sampled densely enough. The proof of correctness for this 
algorithm uses the same techniques as used for the Traveling Salesman result. 
Finally, we discuss how to estimate the necessary sampling density. 

2 One-Manifolds, Regularity and Samples 

We want to consider regular embeddings of compact one-manifolds in arbitrary 
dimension. The restriction on embeddings of compact one-manifolds is equiva- 
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lent to the restriction on simple (closed) curves, because every simple curve is 
homeomorphic to either an interval or the unit circle. The unit circle and the 
closed unit interval [0, 1] are the only compact one-manifolds in the sense, that 
all compact one-manifolds are homeomorphic to one of them. In the following 
we write for an one-manifold if we think of just one of and [0, 1]. 



First we want to specify our regularity assumptions. The following definition 
of regularity depends only on the image of 7 and is equivalent to the definition 
of regularity used in differential geometry. 

Definition 1. Let 7 : [0,1] — > he an embedding, T = {(ti,t2) : ti < 

G [0, 1]} and 






7(^2) - 7(^1) 
17(^2) -7(^1)!’ 



The embedding 7 is called regular at 7(to) with tangent t(7(to)) if for every 
sequence (^„) in T which converges to (to, to) in closure(T) the sequence 
converges to t(^(to))- We call^ regular if it is regular in all points ^ ft), t G [0,1]. 

This definition of regularity carries over to embeddings of S^, since such 
embeddings locally look like embeddings of [0,1]. Our definition of regularity 
has a pure metric interpretation which is fundamental for our proofs. 



Lemma 1. Let 7 he an embedding of M^, which is regular in p € 7. Let 
(Pn), {qn) and (r„) be sequences of points from 7, that converge to p, such that 
Pn < q-n < Tn for oZZ n G N in an order locally around p along 7. Then the se- 
quence of angles (a„) converges to tt, where is the angle at qn of the triangle 
with corner points Pn, qn and r„. 

Proof. Since 7 is homeomorphic the sequences (7“^(Pn)), (7~^(<Zn)) and (7“^(r„)) 
converge to 7~^(p). Thus by our definition of regularity asymptotically the three 
secants 

cony Pn, qn, convqn, rn and convp„, r„ 

have to point in the direction of the tangent t(p). That is, lim„^oo ctn = □ 

Second we want to clarify the notion of a sample S, give a notion of a local 
order of a sample and introduce a measure s{S) for the density of a sample. 

Definition 2. Let 7 he an embedding o/M^. 

1 . A sample S ofj is a finite set S = {p ^, . . . ,p"} where p^ G 7. We assume that 
the sample points p* are ordered according to the order of the T = 7“^(p*), 
i.e. T and have to he adjacent along M^. 

2 . Lf = [0, 1] we write i C j if T < P . Otherwise we choose an orientation 
along 7 and write i C j if i ^ j and 



length(7(p* : jA)) < length(7(p^ : p*)). 



where 7(p* : p^ ) C 7 zs the arc connecting p* and pi in the orientation along 
7. We write iP, j if we want to include the possibility that i = j. 
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3. To every sample the number e{S) is defined as 

e(5') = supmin{|p* — cc| : i = 0,...n}. 

xSi'i 

The notion of local order for embeddings of is well defined, because regular 
embeddings have always finite length. A proof of this can be found in the book 
of Aleksandrov and Reshetnyak [1]. 

3 Geometric Graphs and Polygonal Reconstruction 

Let S' be a finite point set in R'^. TSP(S) is a shortest path through the points 
S, MST(S) is a tree of minimal length on the points S and TST(S) is a Hamilton 
cycle of minimal length of the points S. In general these geometric graphs need 
not to be unique. 

In this section we show that in the case of regular embeddings of [0, 1] Trav- 
eling Salesman Paths (TSP) and Minimum Spanning Trees (MST) solve the 
reconstruction problem for sufficiently dense sampling. In the case of regular 
embeddings the reconstruction problem is solved by a closed Traveling Salesman 
Tour (TST). Furthermore we show that for sufficiently dense samples all edges 
of the polygonal reconstruction are edges of a Delaunay Triangulation of the 
sample points. The Delaunay Triangulation is a “nice” triangulation which can 
be computed efficiently, see for example [4] and [7]. But first of all we give an 
example which shows that the statements above are not as obvious as they might 
seem. 

Consider the following non-regular emmbedding (Figure 1) 7 c : [0,1] — *■ 
given by 

7c t ^ {bj+i - t)/( 6 j+i - bj) ■ 7c(6j) + {t - bj)/{bj+i - bj) ■ 7c(^j+i) 



7c(1) = (1,2). 

Here is 

1. 642 — Qij 'Tc(^ 4 z) — (20.2,0-^), 

^4i+l — Q-i “h 2 , 'Tc(^4i+l) — ^ 2)5 

^4i+2 ~ + 2 * jcihi+ 2 ) = (2ai+i,ai) and 

^4i+3 — Oi 3 • 2 , 7c(^4i+3) — T C^z-t-l)- 

2. Oi = 1 - 2~\ That is, [0, 1] = Ufco[“i>®i+i] {!}■ 



Note 1. Consider the embedding 7,5 one gets from a contraction by a factor 
6 G (0, 1) in the y-direction from 7 c. For every e > 0 there exists a sample S of 
75 with e(S') < e and P{S) yf TSP{S), MSTfS). 
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Proof. Choose samples Si of js such that e(S'i) < 1 — and 

Si n Js{[b4i+l,b4{i+l)]) = {ls{b4i+l),'ls{b4z+2),ls{b4i+3),ls{b4(^i+l)}- 
Replacing the edges 

convjs{b4i+i),ls{b4i+2),conv^s{b4i+2),ls{b4i+3) and convys{b4i+3),'ls{b4{2+i)) 
in P{Si) by 

conv75(64i+i), 75(641+3), conv75(64i+3),75(64i+2) and conv75(64i+2), 75(647+1)) 

leads to a shorter path through the sample points Si. That is, 

P{S,) ^ TSP{S,),MST{S,) for all i G N. 

Since e(S'i) — > 0 as z — > 00 we are done. □ 

By closing 75 in a simple manner we get an example where P{S) yf TST(S) 
for arbitrarily dense samples. 

Note 2. For every e > 0 there exists a sample S of 7c with e{Si) < e such that 
P{S) has an edge that is not a Delaunay edge. 

Proof. Choose 0 < £i < and samples Si of 7c such that e{Si) < 1 — Ui and 

S'* C7c( [ 64 ,+!, 64 ( 4 + 1 )] = {lc{b4z+l +£i),7c(&4*+2),7c(&4i+3),7c(^4(i+l) -£*)}■ 

Then conv7c(64i+2), 70(644+3) C P{Si) cannot be a Delaunay edge by the empty 
ball criterion [7] for Delaunay edges (see Figure 1). Since e(S'i) ^ 0 as z ^ 00 
we are done. □ 

Both embeddings 7c and 75 have finite length. Thus rectifiability, which im- 
plies differentiability almost everywhere, is not sufficient for the Traveling Sales- 
man path to solve the reconstruction for arbitrarily dense samples. Even worser 
in this example the polygonal reconstruction is not part of the Delaunay Tri- 
angulation of the sample points for arbitrarily dense samples. Our contribution 
is to prove that regularity is sufficient. For these proofs we need three lemmas, 
which we show at first. We always set := and := if we deal with 

embeddings of S^. 
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Lemma 2. Let 'j be a embedding o/M^ and let (Sn) be a sequence of samples 
of ^ with 

lim e{Sn) = 0 . 

n—*oo 

Then 

lim max{|p^+i - p^^\ : G S'„} = 0. 

n — »-oo 

Proof. Assume the contrary. Then we find c > 0 and G S'„, such that 

and are adjacent along and |p„ — 9 n| > 2c. By the com- 

pactness of we can assume that (p„) converges to p G 7 and ((/„) converges 
to (7 G 7 , otherwise we choose an appropriate subsequence. On 7 we find a point 
r between p and q along 7 with |r — p| > c and |r — gj > c. For every n G N 
we find r„ G S'„ with \r — r„| < e{Sn). Thus the sequence (r„) converges to r, 
which is impossible for an embedding. □ 

Lemma 3. Let j be a regular embedding of . Then there exists an e > 0 
such that 

1. Ip* — < Ip* — ifkCi — i 

2 . Ip* - p*+i| < Ip* - p'*! if kBi + i 

for all samples S = {p^, . . . ,pl'^l} with e{S) < e and all i £ |5'|}. 

Proof. Assume the contrary. Then there exists a sequence (Sn) of samples with 
lim„^oo s{Sn) = 0 and ph,Pn C 5'n such that 

kCi-l and K - p^f'^\ > K ~ Pn\ 



or 

kBi + l and |p^ - pf^^\ > \pl^ - p^| . 

By choosing a subsequence we can always assume that for all n G N one of the 
above possibilities holds. Without loss of generality assume that this is the first 
one. Since 7 is compact, we can also assume by choosing a subsequence that (pjj) 
converges to p G 7 . From Lemma 2 we get 

lim |p^ - p^l < lim |p^ - p^-i| = 0 . 

n— ^00 n— ^00 



Hence also {pff^) and (p^) converge to p. Now look at the triangle with corner 
points p^“^,pjj and p^. From the law of cosines together with |p^ — pff^\ > 
\Pn ~ Pnl> for the angle a„ at plf^ 



K-p'f?-K-Pl^-^?-\Pt^-P^n 

\pla-Pn-^\\pi-^-p^n\ 



Thus a„ has to be smaller or equal than ^ , but that is a contradiction to Lemma 

1 . □ 



Lemma 4. Let j be a regular embedding of . Then there exists an e > 0 
such that 
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1. \p^ — p™\ < \p^ — p ^ I for k C mC i, 

if k C i — I and |p* — p^\ < — p^^^\ for some I G {1, . . . , |S'|} 

2. Ip* — p™ I < Ip* — p^ I for i C m <C k, 

if kB i + 1 and |p* — p^| < |p* — p^"*"^! for some / G {1, . . . , |S'|} 

for all samples S = {p^, . . . ,pl'^l} with e{S) < e and all i £ {1, . . . , |5'|}. 

Proof. The proof is similar to the proof of Lemma 3. □ 

Now we are ready to prove the theorems for embeddings of [0, 1]. The proof 
for the Traveling Salesman Path and for the Minimum Spanning Tree are done 
at once, because we show that asymptotically the Minimum Spanning Tree of 
the sample points is a unique path, which of course is a Traveling Salesman Path. 

Theorem 1. Let ^ be a regular embedding of [0, 1]. Then there exists an s > 0 
such that MST{S) = TSP{S) = P{S) for all samples S of j with e{S) < e. In 
particular MST{S) and TSP{S) are unique, if e{S) < e. 

Proof. We are done if we can show that there exists an £ > 0 such that the 
polygonal reconstruction P{S„) is a Minimum Spanning Tree on the points S 
and that there is only one Minimum Spanning Tree for all samples S of 7([0, 1]) 
with e{S) < e. Assume the contrary. Then there exists a sequence (S'„) of samples 
Sn = {p\, ■ ■ ■ with lim„^oo £(-S'„) = 0 and MST(S'„) yf P(S'„). That 

means, P{Sn) has to contain an edge e„ = convp^,p^+^ with e„ ^ MST{Sn). 
Adding e„ to MST{Sn) induces a cycle on MST{Sn). All edges in this cycle 
different from e„ have to be of smaller or equal length than e„, because of 
the minimality property of Minimum Spanning Trees. Hence there is an edge 
e'„ = convp^,pJj yf e„ in this cycle with fc E i,i+l E / and |p^— pJil < |p^— P^^^l- 
Lemma 4 tells us that there exists TV G N with 

1- \Pn~ pH < \Pn - pH for all n>N. That means |p^ - pH < \pi ~ PtH^l for 
all n> N. 

2- \Pn-pH~^\ < \pt~pH for all n>N. That means |p^ -pl+^| < |p^ -pl+^| 
for all n > N. 

That is a contradiction to Lemma 3. □ 

Edges of an Euclidean Minimum Spanning Tree are always edges in a Delau- 
nay Triangulation [7] . Thus the edges of the polygonal reconstruction asymptot- 
ically also have to be Delaunay edges. 

The proof for embeddings of is slightly more complicated. We can use 
Minimum Spanning Trees only indirectly for the proof. 

Lemma 5. Let j be a regular embedding o/S^. Then there exists an e > 0 such 
that MST{S) C P{S) for all samples S of ^ with e{S) < e. 

Proof Assume the contrary. Then there exists a sequence (Sn) of samples S'„ = 
with s{Sn) ^ 0 as n ^ oo and MST(Sn) with MST(S'„) ^ 
P{Sn). Denote by the set of edges of P(S'„), by the set of edges of 
MST{Sn) and 



i? = - (E; n E„) and M = E„). 
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From our assumption we have M ^ Choose convp^,pJj € M with k C 1. 
Then there exists convp^,p^+^ G R with k Fi i,i + 1 I, because otherwise 
MST{Sn) has to contain a cycle, which contradicts the definition of a tree. 
Assume ~ P^n^\ < \Pn~Pn\- Removing the edge convp^,pJj from MST{Sn) 
decomposes MST{Sn) in two connected components. Both and have to 
belong to the same component, because otherwise we get a shorter Minimum 
Spanning Tree by replacing convp^,pJj by convpJ^,p5^+^ in MST{Sn). Without 
loss of generality we can assume that convp^,p^ connects the two components, 
because p^ and pjj have to belong to different components. Lemma 4 tells us 
that for sufficiently large n, we have \p^ ~ Ph\ < \Pn ~ Pn\- That means, we 
get a shorter Minimum Spanning Tree by replacing convp^ , pjj by convpj!; , in 
MST{Sn), which is a contradiction. 

Hence we can assume by choosing an appropriate subsequence that |p^ — ph\ < 
IpIijPI^^I for all n G N. Lemma 4 tells us that there exists TV G N with 

1- \Pn~ pH < \Pn - pH for all n>N. That means |p^ - pH < \pi ~ PtH^l for 
all n> N. 

2- \Pn-pH~^\ < \Pn~pH for all n>N. That means |p^ -pl+^| < \p"„-ptH^\ 
for all n > N. 

That is a contradiction to Lemma 3. □ 

Again asymptotically a Minimum Spanning Tree of the sample points has to 
be a path, which consists of Delaunay edges. By adding the edge that connects 
the start- and the endpoint of this path we get the polygonal reconstruction. We 
claim that this last edge is also an Delaunay edge. To show this we use the notions 
of Lemma 5 and the empty ball criterion [7] for Delaunay edges. The endpoints of 
MST{Sn) have to be p^ and p^+^ for some index iG {!,..., |S'„|}. Assume that 
the open balls with diameter \pH^ ~pH ^rid {p^+^,p^} C boundary(R„)nS'„ 
are not empty for all n G N. Then there has to exist p^ G with k C i or 
A: B i -I- 1 and 

\pu^^-pHHpI-pH<\pIH^-pH- 

That contradicts Lemma 3. Thus asymptotically all the balls have to be 
empty and all edges of the polygonal reconstruction have to be Delaunay edges. 

Theorem 2. Let j be a regular embedding o/S^. Then there exists an e > Q 
such that TST{S) = P{S) for all samples S of j with s{S) < e. Especially 
TST{S) is unique, if e{S) < e. 

Proof. We do the proof by contradiction. Assume that there exists a sequence 
(S'n) of samples = {p^, . . . ,plf”'} with £(S'„) — > 0 as n ^ oo and TST(S'„) yf 
P{Sn). Using Lemma 5 and choosing an appropriate subsequence we can assume 
that MST{S„) C P{Sn) for all n G N. Removing the largest edge of TST{S„) 
from TST{Sn) gives us a path through the sample points S„. This path cannot be 
a Minimum Spanning Tree, because otherwise TST(Sn) = P{Sn). Furthermore 
this edge has to be shorter than the edge e„ = convp^,p^+^ which connects the 
endpoints of MST{Sn). For all n G N we consider two cases, there exists an edge 
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= convp^,pJj in TST{Sn) with fc E i, i + 1 E / or such an edge does not exist. 
By choosing a subsequence we can assume that only one of these possibilities 
holds for all n G N. Assume that this is the first one. By construction e„ is larger 
than all edges of TS'T(S'„). In particular e„ is larger than e^, i.e. 

\Pn~P‘n\ < K-P'u^l- 

Lemma 4 tells us that there exists iV G N with 

1- \Pn~ pH < \Pn - pH for all n>N. That means - pH < \pi ~ pH~^\ for 
all n> N. 

2- \Pn-pH~^\ < \Pn~PH for all n>N. That means -K+^| < Ip’a-ptH^l 
for all n > N. 

That is a contradiction to Lemma 3. Next we consider the second case. That is, 
does not exist for all n G N. We split all TST{Sn) in two paths Pi{Sn) and 
P 2 {Sn), with endpoints and plH^- The idea is to show that both paths alone 
approximate 7 and especially the length of 7 if does not exist. We need three 
facts from differential geometry. First, regular embeddings have finite length [1]. 
Second an inequality which follows directly from the definition of length 

limsup ^length(rS'T(S'„))^ < length(7). 

And third an inequality which can be derived from a theorem of Menger [9] (here 
we omit the technical proof) 

liminf ^length(Pi(S'„)^ > length(7), z = 1,2. 

On the other hand we have 

limsup length(rS'T(S'„)) = limsup ^length(Pi(S'„)) + length(P2(<S'n))^ 

> liminf ^length(Pi(5„))^ + liminf ^length(P2(S'„))^ 

> 2 length(7). 

That is also a contradiction. □ 

4 Algorithms for Polygonal Reconstruction 

We are looking for an efficient algorithm which always computes a simple polygon 
or a collection of simple polygons through the sample points. Minimum Spanning 
Trees can be computed efficiently, but in general the Minimum Spanning Tree 
of a sample is not a polygon. We have a problem if the points are not sampled 
densely enough. On the other hand Traveling Salesman Paths and Traveling 
Salesman Tours are always simple polygons (we call a polygonal line also a 
polygon), but they are NP-hard to compute. 
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Here we describe an efficient algorithm which always computes a path or 
a tour. The output of this algorithm need not be a simple, but later we will 
discuss a variant which always computes a collection of simple polygons. We 
start with the description of the basis algorithm, because it is easier to analyze. 
The algorithm holds in every iteration of its main loop a path until all sample 
points are vertices of this path. Again 7 denotes an embedding of and S = 
{p^, . . . a sample of 7 . 

Algorithm 1. 

1. Choose an arbitrary point G S and connect it to one of its nearest neighbors 
in S — {p*}. This defines a path with two vertices and 

2. Consider the path with vertex set V = {v^,...,v^}, s.t. is connected to 

^fc+i ^ ^ j _ 1 . Calculate nearest neighbors v~ of G V and U+ 

of G V in S — V . Connect v^ to v~ if — u“| < \v^ — u+j. Otherwise 
connect to . 

3. Repeat step 2 until j = [S'!. 

4- If j is closed then connect with til'll. 

Let A(S') denote the polygon which is computed by the algorithm. Next we 
prove that for dense samples A(S') equals the polygonal reconstruction P{S) 

Theorem 3. Let j be a regular embedding o/M^. Then there exists an s > 0 
such that A(S') = P{S) for all samples S of j with s{S) < s. 

Proof. We look at the construction of A(S'). In the first step we connect an 
arbitrarily chosen p* to one of its nearest neighbors. From Lemma 3 we know 
that this neighbor has to be one of p*“^,p*“''^ if e{S) is sufficiently small. Hence 
p* is connected to p*“^ or p*+^ for sufficiently small s{S). 

In the second step we connect one of the endpoints of the polygon with vertex set 
V = . . . , v^} with one of its nearest neighbors. Let without loss of generality 

v^ = p* be this endpoint and p^^^ = v'^ G V. Assume that v~ = p^ , I ^ k — 1. 
That is |p^ — P*| < \p^ — P*~^|- Then there are the following possibilities: 

I G k — From Lemma 3 we know that this cannot happen if e{S) is 
sufficiently small. 

2. I B fc + 1. By induction we can assume that (here the indices are taken 
modulo n) 

yj = pk+j^yJ-1 = . . . , = P^ 

That means I ^ {k, . . . ,k + j} and therefore k + j Cl. We distinguish two 
cases 

(a) I = k + j + 1: We have — p^| < \p^+i+'^ — p^+i\. 

That contradicts Lemma 3 in the point for sufficiently small e{S). 

(b) / B fc + j + 1: We have |p^ — p^| < \p^+t+^ _ pk\^_ 

That contradicts Lemma 4 in the point p^ for sufficiently small £(5'). 
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That means is connected to v~ = p^~^ for sufficiently small e{S). If we repeat 
the last step until j = [S'] we get a path in which every vertex ri-l, j = 1, . . . , IS”! — 1 
is connected to the same neighbors as in P{S). This remains valid if 7 is closed 
and we connect with . □ 

Since we start the algorithm in an arbitrary sample point it is natural to ask 
whether we can start in all sample points simultaneously. This leads to a variant 
of Algorithm 1 which does not hold just one path but a collection of vertex 
distinct paths in every iteration of its main loop. Again let S = {p^, . . . be 

a sample of an embedding 7. 

Algorithm 2. 

1. Let V := . . . , he a set of trivial paths. That is, all paths in P 

consist of just one vertex. 

2. Connect two paths Pi, Pj G V if there exist two endpoints, one of each path, 
such that these endpoints are nearest neighbors* of each other. That is, the 
set P of paths is updated as follows 

V:= (v-{Pi,P,})u{nuP,} 

3. Repeat step 2 as long as connections are possible. 

4 . For every path in V connect its endpoints, if these endpoints are adjacent 
via a Delaunay edge. 

* A nearest neighbor of an endpoint p of a path is another endpoint of a different 
path which has the shortest distance to p among all such endpoints. 

This algorithm also computes the polygonal reconstruction for sufficiently 
dense sampling. From our earlier considerations we can restrict the search for 
nearest neighbors in step 2 to points adjacent via a Delaunay edge. The output 
of Algorithm 2 is then always a subset of the edges of the Delaunay Triangulation 
and its running time is bounded by the time we need to compute the Delaunay 
Triangulation, which is 0(|S'| ) resp. OdS”! log(|S'|)) if d = 2, where d is the 

dimension of the embedding space. The output need not consist of exactly one 
component, in general it is a collection of simple polygons which can be open 
or closed. Hence Algorithm 2 can serve as a basis for heuristics which solve 
the reconstruction problem for non-connected simple curves. Branching points 
remain difficult to handle. It seems that the best starting point for heuristics 
which solve the reconstruction problem with branching points is a Minimum 
Spanning Tree. 



5 Good Samples 

So far we have shown existence proofs. In this section we want to characterize 
a class of samples for which our theorems holds. We give a checkable condition 
for samples to be good. 
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Definition 3. Let 7 he an embedding o/M^, S he a sample of ^ and Br{p) the 
ball with radius r > 0 and center p. We call S a good sample, if Br{p^) n 7 is 
homeomorphic to the unit interval [0, 1] for all 

r < (1 + ? 7 )max{|p*+i -p*|, |p* for all i G {1, . . . , |S'|}, 



where rj > 0 is an arbitrarily small constant. 

The scaling with (1 + 77) prevents problems with degenerate samples, where 
two or more sample points exist which all have the same distance to one and the 
same sample point. 

Good samples fulfill the properties stated in our key Lemmas 3 and 4. In the 
proofs of our theorems we construct contradictions to these Lemmas. Thus for 
regular embeddings of [0,1] and good samples S we have P{S) = TSP{S) = 
MST{S) = Gl(S') and for regular embeddings of and good samples S we have 
P{S) =TST{S) = A{S). 

Finally we want to show that good samples exist and that samples that 
contain a good sample as a subset are good samples itself. 

Lemma 6. Let 'j be a regular embedding o/M^. Then there exists a good sample 

ofl- 



Proof Set r{p) = sup{r : Br'{p) n 7 ~ ( 0 , 1 ) for all r' < r} for all p G 7. We 
show that 

inf r{p) > 0. 

pG'f 

Then every sample S = {p^ , . . . of 7 with 



sup{|p*+i-p*| : z=l,...,\S\}< 



infp£7 r{p) 
1 + 7 



is a good sample. At first we show 



sup {r : Br>{p) C 7 ~ ( 0 , 1 ) for all r' < r} >0 for all p G 7. ( 1 ) 



Assume the contrary. Then there exists p G 7 such that for all n G N there exists 
0 < r„ < - with Br„(p) H 7 is not homeomorphic to (0, 1). Thus we can find 
PmP'n G Bi/n{p) H 7 with p C p„ C p]j (or p], C p„ C p, what can be handled in 
a similar way) and \p'„ — p| < |p„ — p|. Look at the triangle with corner points 
p,p„ and p'„. From the law of cosines together with jpj^ — p| < |p„ — p|, we find 
that the angle at p„ has to be smaller or equal than By construction (p„) and 
(p[j) converge to p, but that is a contradiction to Lemma 1. Thus inequality (1) 
is valid for all p G 7. Next assume that 



inf r(p) = 0, 

P67 

i.e. we can find a sequence (p„) in 7 with lim„^oo 'c(pn) = 0 . By choosing an 
appropriate subsequence we can assume that (p„) converges to p G 7, because 
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Ml is compact. From (1) we have r{p) > 0. We find TV S N with S Br(p){p) 
for all n > TV and 

Br{Pn) H 7 C Br(p)(p) H 7 

for all r < r{p) — |p„ — p\. Since lim„^oo \Pn — p| = 0 we have 

liminfr(p„) > > 0 . 

That is a contradiction. □ 

Lemma 7. Let 7 he an embedding ofM.^, S a sample ofj and p € 7 — S'. Then 
S U {p} is again a good sample. 

Proof. There has to exist an index iG{ 1,...,|S|} with p* C p C We have 

p G _pi I (p*) 07 , 

because otherwise S cannot be a good sample. The open balls Br{p) with radius 
r < (1 + 77 ) max{|p*+i - p\, |p -_p*|} 

are all covered by the open ball B|pt+i_pt| (p), if 77 is sufficiently small. That is, 
the sets Br{p)r\^ are open, connected subsets of i?|pi+i_p.|(p*) Gy and therefore 
are homeomorphic to ( 0 , 1 ). □ 
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Shape- from-Silhouette/Stereo and Its 
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Abstract. A novel three-dimensional reconstruction algorithm “Shape- 
from-Silhouette/Stereo (SFS^)” is presented in this paper. This algo- 
rithm combines Shape- from-Silhouette with stereo based on simple voting- 
localizing operations in a voxel space. In this algorithm, Shape-from- 
Silhouette roughly estimates the shape of the target object first, and 
then multi-eye stereo is applied within the estimated area to refine the 
shape. This algorithm overcomes the shortcomings of each algorithm and 
offers the following: 1) More precise shape reconstruction than simple 
Shape-from-Silhouette; 2) Quicker processing than simple stereo-based 
reconstruction; and 3) Better noise reduction than multi-eye stereo. Our 
experiments showed that SFS^ is highly practical for generating 3D mod- 
els of real objects. 



1 Introduction 

Recently it has become possible for ordinary personal computer (PC) users to 
handle three-dimensional (3-D) images and to create their own virtual world. In 
addition, the use of VRML (Virtual Reality Modeling Language), the standard 
language for 3-D image communication on the Internet, is spreading widely. 
On the Internet, ordinary PC users are not only receivers but also providers of 
information, and may wish to input 3-D images. State-of-the-art techniques for 
digitizing a real object fall into two categories: contact-based digitizers and non- 
contact scanners. A typical contact-based digitizer has a stylus, which is used to 
touch the target object to be measured. On the other hand, most non-contact 
scanners are based on active methods, using special ray projectors such as a 
laser [1], sheet-of-light[2,3], or structured-light. However, some of these are very 
expensive, while others require users to have skill and know-how. 

We have been developing an inexpensive, compact and skill-free 3-D scanner 
based on passive sensing methods. Many studies have been reported in this 
area and categorized with the “Shape-from-X” framework[4,5,6,7,8]. Shape-from- 
Silhouette[4,9] is one of the simplest and most powerful techniques, however, it 
can not reconstruct concave parts of the object. Stereo[6] is another powerful 
technique but the correspondence search, the main task of stereo, is very time- 
consuming. 

This paper presents a novel algorithm — Shape-from-Silhouette/Stereo (or 
SFS^) — which reconstructs the 3-D shape of objects from a series of 2-D images. 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 177-188, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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In this algorithm, Shape-from-Silhouette and a stereo technique are combined 
based on voting-localizing operations in a three-dimensional voxel space. More 
specifically, Shape-from-Silhouette is applied first to give a rough estimate of the 
object presence area, and then the stereo process refines the shape. 

The combination overcomes the shortcomings of each algorithm and the main 
advantages are as follows: 1) More precise shape reconstruction than simple 
Shape-from-Silhouette; and 2) Quicker processing than simple stereo processing. 
In addition, bounding the probable area by Shape-from-Silhouette leads to: 3) 
Better noise reduction than conventional multi-eye stereo. 

This paper is organized as follows: Section 2 briefly reviews related works. In 
Section 3, details of the algorithm are described. Section 4 overviews a prototype 
system for testing the algorithm. Experiments and preliminary evaluations are 
reported in Section 5, and Section 6 summarizes the paper. 

2 Related Works 

This section briefly describes typical related works: Shape-from-Silhouette and 
voxel-based stereo. 

2.1 Martin’s work 

Martin and Aggarwal[10] presented a volumetric description technique suitable 
for deriving 3-D object representations from two-dimensional images. The tech- 
nique is based on the idea that by occluding contours from an image sequence 
with viewpoint specifications, a bounding volume is determined that approxi- 
mates the object generating the contours. 

The technique includes two major stages: initial representation creation and 
continuing representation refinement. In the initial representation creation, two 
image planes are selected and the initial volume segments, which are a set of 
line segments with a linked-list structure, are generated. With the continuing 
representation refinement, the volume segments are projected onto the other 
image plane, which constrain the extent of the object further so that the volume 
segments can be modified to represent the object with higher fidelity. 

2.2 Zheng’s work 

Zheng and Kishino[ll] reported on an analysis of areas unexposed to contours 
and a method of detecting them. They focused on the fact that the unexposed 
area, which might be a concave or planar surface, was numerically related to the 
nonsmoothness of the contour distribution. 

To detect unexposed area, they pointed out the fact that the line of sight 
will touch two convex tangent points at the same time and then jump from the 
first mountain to the second along a particular viewing direction. Second-order 
differential filtering is applied to the trace of the contour point of the epipolar- 
plane image. The jumping points (or cusp points) are extracted as the local 
maxima in the filtered sequence. 
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2.3 Seitz’s work 

Seitz and Dyer [12] proposed a voxel-based shape reconstruction technique which 
is able to cope with large changes in visibility and its modeling of intrinsic scene 
color and texture information. They introduced the ordinal visibility constraint 
and this technique traverses a discretized 3-D space in “depth-order” to identify 
voxels that have a unique coloring that is constant across all possible interpre- 
tations of the scene. 

The main advantage of their algorithm is that it has no occluding problem, 
which afflicts most stereo algorithms. A voxel model of an object is reconstructed 
by estimating the consistency of each voxel with the likelihood ratio, taking 
occlusion between voxels into account. 

3 Shape from Silhouette/Stereo 

Shape from Silhouette/Stereo (SFS^) is an algorithm for reconstructing the 
three-dimensional shape of target objects from a series of two-dimensional im- 
ages. In this scheme, Shape-from-Silhouette[10] and stereo[6] are combined based 
on voting-localizing operations onto three-dimensional voxel space. 

SFS^ comprises two major stages: 1) Rough shape estimation with voting- 
based Shape-from-Silhouette, and 2) Precise shape reconstruction based on voxel- 
based stereo. This combination offers the following advantages: 

1. More precise shape reconstruction than simple Shape-from-Silhouette 

A naive Shape-from-Silhouette algorithm reconstructs only convex parts of 
objects. By applying a stereo technique, concave parts can be reconstructed. 

2. Quicker processing than simple stereo-based reconstruction 

Since Shape-from-silhouette provides a rough estimate of the target shape, 
the succeeding stereo process is applied only to the bounded area, or in other 
words, search areas for stereo-matching are greatly reduced. 

3. Higher noise reduction property than multi-eye stereo 

The preceding Shape-from-Silhouette limits the search area for stereo match- 
ing, The risk of wrong matching results and the resultant noise are also 
reduced. This improves the accuracy. 

An additional advantage of SFS^ is as follows. Assume a very simple 3-D digi- 
tizing configuration comprising a monoscopic camera and turntable, where the 
object is placed on the turntable and all-around views are captured by rotating 
the table stepwise. With this setup, the captured scene is not stationary; the ob- 
ject moves while the background is stable. Therefore a simple stereo algorithm 
does not work appropriately with this configuration. Since SFS^ separates the 
object area from the background in the input image, it is free from this problem 
and this leads to the following advantage: 

4. Enhanced applicability: Shape-from-Silhouette separates the object and back- 
ground. Owing to this property, SFS^ does not constrain the system config- 
uration. SFS^ is applicable to a multi-eye camera system, moving-camera 
system as well as monoscopic stationary camera system with a turntable. 
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Note that SFS^ differs from the related works mentioned earlier concerning 
the following points: 1) Voting-based Shape-from-Silhouette is embedded for 
enhancing robustness, which allows a few silhouette errors, whereas Martin’s 
work and Zheng’s work assume errorless silhouettes. 2) Voxel-based stereo is 
performed with block-matching in order to allow noises in input images although 
Seitz’s work detects color similarity in a pixelwise manner. 



3.1 Formalization 



Here, we make the following assumptions: 

1. A series of images of all-around views of a target object is input. 

2. Camera positions (or observation points) for all images are known. 

3. Internal camera parameters are known. 

4. The probable area of object presence can be defined in the 3-D space. 
Hereafter, the description is based on the following notation: 

— V : a set of voxels (3-D voxel space). 

— Vs : a set of voxels which correspond to the surface of the target object in 
the voxel space V; Vs C V (object surface). 

— P: i-th input image plane. /* and are the z-th original input image (gray- 
scale or color) and its silhouette image (B/W) respectively. Furthermore, 
and are the silhouette and background area of respectively. 

— p{v,i): the projection point of v on P. 

Original images and silhouette images give important clues and constraints 
concerning the object presence area in the voxel space. A voxel of Vs is projected 
on the silhouette part of input images (silhouette constraint). On the other hand, 
the pixels of input images on which a voxel of Vs is projected are expected to 
have the same color or intensity (color constant), assuming 1) homogeneous 
illumination environment, 2) noise-free image, and 3) Lambertian surface. 



Silhouette constraint 

for V € 14 on P as 



The total Silhouette 



condition We define the local silhouette constraint 



lifp(u,z) G Pss 
’ ^0 otherwise. 

constraint is: 

n n = 1- 

i veVa 



( 1 ) 



Color constant condition We define the observability of u on as 



„ . .s I 1 if u is projected on P without occlusion 
0 otherwise. 



Also, we define the observable image set for v as O'" = {z|Co(z;,z) = 1}. 
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The Color constant constraint is: 

= ( 2 ) 

where D(y,i) represents an evaluation of color dispersion for v. A typi- 
cal example for D(v,i) is T.uockareai'^ipiy^ *)) - v{p{v,j))Y, where 

v{p{v,j)) is the color value at p{v,j). 



The target problem is to determine the set Vg using these conditions. The two 
major stages of SFS^ determine the localized voxel set Vg from V. The Shape- 
from-Silhouette stage produces an intermediate set of voxels Vj, which gives a 
rough estimate of the object presence area based on the silhouette constraint 
condition. The voxel-based stereo further localizes Vg (C Vb) with the color 
constant condition. 



3.2 Stage 1: Shape-from-Silhouette 

As mentioned above, a silhouette of a target object gives important constraints 
of the object shape. More precisely, a point in the 3-D space projected onto the 
inside of a silhouette may be occupied by the object, whereas a point which is 
projected onto the outside of the silhouette is definitely free. The 3-D space occu- 
pied by the object can be estimated more precisely by using multiple silhouette 
images taken at various observation points. 

For clarity, we first briefly describe simple Shape-from-Silhouette, and then 
explain voting-based Shape-from-Silhouette which is embedded in SFS^. 



Simple Shape-from-Silhouette In simple Shape-from-Silhouette, the object 
presence area can be estimated in a manner similar to the process of carving a 
sculpture from a block of wood, namely, removing the parts which are apparently 
out of the object area. Assuming a voxel space, this simple algorithm reconstructs 
the three-dimensional shape of the target object with the steps shown in Fig. 1 
(a). 



Voting-based Shape-from-Silhouette The simple and conventional “Shape- 
from-Silhouette” method regards silhouettes as strong constraints. This means 
the method assumes errorless silhouettes. 

In practical applications, however, silhouettes may contain several errors. In 
this case, the silhouette errors seriously affect the shape reconstruction. If part 
of the silhouette of the target object is chipped, for example, the resultant shape 
has considerable errors. 

For the above reason, the Silhouette constraint should be relaxed for practi- 
cal applications. SFS^ adopts an extended algorithm to handle silhouettes with 
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For all voxels: — (initialization) 




For all voxels: — (initialization) 


flagiv) ON. 




score(v) 0. 


H := cp. 




Vb := cp. 


For all voxels: — (carving) 




For all voxels: — (voting) 


For all silhouettes: 




For all silhouettes: 


p 

II 

o 




p 

Ik 

o 


Then flag{v) OFF. 




Then score{v) score{v) + 1. 


For all voxels: — (localization) 




For all voxels: — (localization) 


If flag{v) = ON, 




If score{v) > Threshold, 


Then Vb := Vb U {u}. 




Then Vb := Vb U {u}. 



(a) (b) 



Fig. 1. Algorithm for simple Shape-from-Silhouette (a) and voting-based Shape- from- 
Silhouette(b). 




Fig. 2. Voting on the voxel space: the density shows the score of each voxel. 



errors for wider applicability, where the silhouettes are treated as weak con- 
straints. In the extended algorithm, the shape estimation is performed with a 
voting- localizing scheme as shown in Fig. 1 (b). 

This extended approach has a great advantage over the simple method; 
silhouette images including several errors do not seriously affect the object 
shape obtained. Moreover, the voting-localizing scheme gives a general frame- 
work where the conventional approaches can be treated as a special case of 
voting-based Shape-from-Silhouette: only full-marked voxels are judged to be in 
the object area. 



3.3 Stage 2: Voxel-based Stereo 

If the target object contains a concave part, Shape-from-Silhouette does not re- 
construct the 3-D shape correctly. Incorporating a stereo technique, SFS^ detects 
and recovers the concavity as long as the object surface has sufficient texture 
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information. This stage consists of the following substages: 1) partial surface 
estimation, and 2) total model generation. 



Partial surface estimation The basic idea of this stage is as follows: if a voxel 
V corresponds to the surface of the object, the color of its projection point on 
each image plane is similar to each other. Based on this idea, errors of these 
colors are accumulated on each voxel first (error voting). Then, voxels with the 
minimum accumulated error are chosen. This set of voxels can be determined for 
each observation point and is regarded as the partial surfaces for the observation 
point. Now we refer to the partial surfaces for the i-th observation point as Vgp. 

We assume that the observable image set includes at least n consecutive 
members for any v G V^p for any observation point i. This means that the 
occlusion problem can be avoided for n consecutive input images for any voxels 
of any partial surfaces. 

Taking lighting fluctuation and image noises into account, this assumption 
allows SFS^ to relax the color constant condition (Eq. 2) as follows: 

EE D{v, i) min, (3) 

veVs ieO^ 

where is n consecutive members of O". 

Here, let E,,\, be a set of voxels along a ray from the i-th observation point 
through V. For the i-th observation point, the partial surface V^p is identified 
with the procedure shown in Fig. 3 (a). The steps are repeatedly applied to all 
observation points. Eventually as many partial surface sets are obtained as the 
original images. 



For all voxels r; G Vf>: — (initialization.) 




For all voxels n G Vf>: — (initialization) 


score{v) 0. 




score{v) 0. 


II 




K. := 0- 

For all voxels v ^ Vb'. '■ — (voting) 


For all voxels n G Vj>: — (voting) 




For all observation points: 


score{v) score{v) + D{v, i). 




If L{v,i) > L{v',i), [v G V^,v' G n 

Then score{v) := score(v) + 1. 


For all voxels v ^ Vi,: — (localization) 




For all voxels v ^ Vb'. — (localization) 


If score{v) — MINyi{score{v)), 




If score{v) > Threshold, 


Then := V:„ U {«}. 




Then Vc := Vc U {v}. 



(a) (b) 



Fig. 3. Algorithm for partial surface estimation (a) and total model generation (b) . 
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Fig. 4. (a) Partial surface estimation (partial surface localization on a ray); the voxel 
with the minimum score on a ray is chosen as the partial surface, (b) Total model 
generation (voting step at the i-th observation point): only voxels behind the partial 
surface are voted. 



Total model generation This substage generates a total 3-D model from 
partial surfaces. Although a partial surface is calculated at each observation 
point, each may have errors. Integrating the partial surfaces with the voting- 
localizing scheme, the errors are rounded to obtain the total model of the object 
with higher accuracy. 

Let L(v, i) be the Euclidean distance between v and the i-th observation 
point. Vc is a set of voxels which gives the volumetric representation of the 
object. The integration process is shown in Fig. 3 (b). After the process, Vg is 
easily obtained by detecting the boundary of Vc- This procedure does not provide 
the exact solution which satisfies the relaxed color constant condition but gives 
a good approximation. 

4 Application of SFS^ to 3-D Digitizer 

We have built a 3-D digitizer to test the applicability of the SFS^ algorithm to 
3-D modeling. The digitizer comprises a turntable, a monocular camera and a 
personal computer (Fig. 5). This digitizer is capable of automatically generating 
the 3-D texture-mapped model of the target object. 

The 3-D digitizing process consists of five major steps: 1) calibration, 2) 
image capturing, 3) silhouette extraction, 4) shape modeling, and 5) texture ac- 
quisition. SFS^ is embedded in the shape modeling process. The other processes 
are briefly shown below^. 

4.1 Calibration 

In this process, camera position relative to the turntable is automatically cal- 
culated using an object of known shape placed on the turntable. By rotating 
the turntable, the object is captured from various observation points (original 



^ For the details, refer to [9,13]. 
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Fig. 5. System overview 



images). In the current implementation, a calibration panel with circle patterns 
is used. After detecting each circle position, the Hough transform[13] is applied 
to calculate camera position for enhancing robustness. 

4.2 Image capturing 

In this step, a series of images is captured to cover all views of the object. In 
addition, the background image is captured as well. The background image is 
almost the same as the original images except that it excludes the target object, 
and is used for extracting silhouettes. 

4.3 Silhouette extraction 

A silhouette image corresponding to each object image is generated based on 
subtraction operations. In this step, naive silhouette extraction, or pixel-level 
extraction, is performed first. Then, supplemental silhouette extraction, i.e., 
region-level extraction, is applied, which improves the accuracy of the silhou- 
ette obtained. In the region-level extraction, the average of absolute subtraction 
at each pixel in a region is calculated to judge if the region corresponds to the 
object or not. 

4.4 Shape modeling 

The shape modeling step consists of two processes: volumetric modeling and 
surface representation conversion. SFS^ is applied to volumetric modeling. In 
the surface representation conversion, many initial polygons are generated first 
by connecting adjacent surface polygons. Then, a polygon reduction operation 
merges polygons to reduce the total number of polygons. 

4.5 Texture acquisition 

The texture of a patch can be derived from multiple original images because a 
patch can be seen from various viewpoints. Therefore, an original image should 
be identified for each polygon to derive its texture. 
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In order to determine the source image appropriately, the following points 
should be considered. 1) A larger amount of texture information in the source 
image leads to a better derivation of texture. 2) Smoothness of texture on the 
boundary of patches is required for concealing the patch edges. To meet both 
requirements, we solve the texture acquisition problem based on an energy min- 
imization method [9]. 



5 Experiments and Preliminary Evaluations 

We tested the SFS^ algorithm with ceramic toys (a miniature castle and squir- 
rel). In the experiments, the step of table rotation was ten degrees so 36 orig- 
inal images were taken. The image size was 640x480. The resolution of the 
voxel space was 320 x 320 x 240. The color constant condition was evaluated by 
block-matching with a 15x15 mask. The number of consecutive member was 
assumed to be five. 

Fig. 6 and Fig. 7 show examples of original images of each object, wireframe 
3-D models obtained and resultant texture-mapped 3-D images. The modeling 
results show that the SFS^ reconstructs the shape with practical accuracy. 

The computational time of each stage is shown in Table 1. The data were 
measured on Dell’s Optiplex GX1-400MT personal computer (CPU: Pentium II 
400MHz, Memory: 128MB). The measurements show that computation time for 
voting-stereo is still dominant. However, we consider that a further speed increase 
will be achieved with a multi-resolution technique, where stereo processing with 
lower resolution bounds the area of voxels for stereo processing with higher 
resolution. 




Fig. 6. Examples of original images and modeling results (castle) 
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Fig. 7 . Examples of original images and modeling results (squirrel) 



Table 1. Analysis of processing time 



Step 


1 Processing time (sec) I 


Castle 


Squirrel 


Silhouette extraction 


53 


31 


Shape modeling 


Shape-from-Silhouette 


93 


36 


Voting-based stereo 


1,230 


1,870 


Surface representation 


106 


38 


Texture acquisition 


43 


43 



6 Summary 

A novel three-dimensional reconstruction algorithm “Shape-from-Silhouette/ 
Stereo (SFS^)” has been presented in this paper. This algorithm combines the 
Shape-from-Silhouette with Shape-from-Stereo based on a voting-localizing 
scheme with the voxel space. 

The main advantages of this algorithm are as follows: 1) Quicker processing 
than a simple stereo-based reconstruction, 2) More precise shape reconstruction 
than a simple Shape-from-Silhouette, and 3) Better noise reduction than multi- 
eye stereo. An additional advantage is that this algorithm enables the stereo 
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technique to be applied to the digitizer based on the monoscopic camera system 
with a turntable, where the object moves against the stationary background. 

We tested this algorithm with toys to evaluate the applicability. The exper- 
iment showed that SFS^ is highly practical for generating 3-D models of real 
objects. 

One of the current problems of the algorithm is that sometimes the sur- 
face of a thin part may be missing due to errors in partial surfaces. This error 
can be detected by checking consistency between two silhouettes: one generated 
from the final model and the other from the probable object area estimated by 
Shape- from-Silhouette. After detecting the inconsistency, adaptive thresholds for 
localizing the final model should be applied. Computational time should be also 
reduced. This will be achieved by applying a multi-resolution technique. 
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Abstract. Connectivity is a topological notion for sets, often introduced 
by means of arcs. Classically, discrete geometry transposes to digital sets 
this arcwise appoach. An alternative, and non topological, axiomatics has 
been proposed by Serra. It lies on the idea that the union of connected 
components that intersect is still connected. Such an axiomatics enlarges 
the range of possible connections, and includes clusters of particles. 

The main output of this approach concerns filters. Very powerful new 
ones have been designed (levelings), and more classical ones have been 
provided with new properties (openings, strong alternated filters) 

The paper presents an overview of set connection and illustrates it by fil- 
terings on gray tone images. It is emphazised that all notions introduced 
here apply equally to both discrete and continuous spaces. 



1 The connectivity concepts 

1.1 Classical connectivity and image analysis 

In mathematics, the concept of connectivity is formalized in the framework of 
topological spaces and is introduced in two different ways. First, a set is said to be 
connected when one cannot partition it into two non empty closed (or open) sets. 
This definition makes precise the intuitive idea that [0, 1] U [2, 3] consists of two 
pieces, while [0, 1] consists of only one. But this first approach, extremely general, 
does not derive any advantage from the possible regularity of some spaces, such 
as the Euclidean ones. In such cases, the notion of arcwise connectivity turns 
out to be more convenient. According to it, a set A is connected when, for 
every a,b € A, there exists a continuous mapping xp from [0, 1] into A such that 
xp{0) = a and xp^l) = b. Arcwise connectivity is more restrictive than the general 
one ; however, in any open set which is connected in the general sense is also 
arcwise connected. 

A basic result governs the meaning of connectivity ; namely, the union of 
connected sets whose intersection is not empty is still connected : 

{Ai connected} and {nAj 0} {UA^ connected} (1) 

In discrete geometry, the digital connectivities transpose the arcwise corre- 
sponding notion of the Euclidean case, by introducing some elementary arcs 
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between neighboring pixels. This results in the classical 4- and 8-square connec- 
tivities, as well as the hexagonal one, or the cuboctahedric one in 3-D space. 
Is such a metric approach to connectivity adapted to image analysis ? We can 
argue that 

a/ Certain arcwise connections seem smowhat shaky, e.g. when they do not 
treat equally a set and its complement; 

b/ In discrete motion analysis, the trajectories of fast moving objects of- 
ten appear as dotted tubes, and arcwise connections are unable to handle such 
situations; 

c/ more deeply, one can wonder what is actually needed in image processing. 
As a matter of fact, when we examine the requirements for connectivity, we 
observe that the basic operation they involve consists, given a set A and a point 
a; G A, in extracting the particle of A at point x. For such a goal, an arcwise 
approach is obviously sufhcent. But is it necessary? 



1.2 The notion of a connection 

These criticisms led G. Matheron and J. Serra to propose a new approach, in 
1988 [ SER88] where they take not (1) as a consequence, but as a starting point. 
However, their definition is rather general and stated as follows. 

Definition 1. (G. Matheron and J. Serra) Let E he an arbitrary space. We call 
connected class or connection C any family in V{E) such that 

(i) 0 G C and for all x G E, {a;} S C 

(ii) for each family {Ci} in C, C\ Ci ^ 0 implies UCi G C. 

As we can see, the topological background has been deliberately thrown out. 
The classical notions (e.g. connectivity based on digital or Euclidean arcs) are 
indeed particular cases, but the emphasis is put on another aspect, that answers 
the above criticism c/ in the following manner ([ SER88], Chap. 2) : 

Theorem 2. The datum of a connection C on V{E) is equivalent to the family 
{'^x , X G E} of openings such that 

(Hi) for all x G E, we have jx(x) = {x} 

(iv) for all ACE, x,y G E, ^x{A) and 7 j/(A) are equal or disjoint 

(v) for all ACE, and all x G E, we have x ^ A lx{A) = 0. 

An alternative, but equivalent, axiomatics has been proposed by Ch. Ronse 
[RON98] ; it contains, as a particular case, another one by R.M. Haralick and 
L.G. Shapiro [ HAR92] ; however, both approaches are still set-oriented. The ex- 
tension from sets to the general framework of complete lattices and in particular 
to numerical functions has been proposed by J. Serra [ SER98a] 

Historically speaking, the number of applications or of theoretical develop- 
ments which was suggested (and permitted) by this theorem is considerable (see, 
among others [MAR94][MEY94][SAL96]). It has opened the way to an object- 
oriented approach for segmentation, compression and understanding of still and 
moving images. 
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2 Examples of connections on V{E) 

Several instructive examples of connections on V{E) can be found in [ HEI97], 
in [RON98] and in [ SER98a]. Here we just recall a few of them, which are of 
interest for the present study. 

i/ All arcwise connectivities on digital spaces are connections in the sense of 
definition 1; 

ii/ In [ SER88] ch.2, J. Serra provides E with a first connection C and con- 
siders an extensive dilation S : V{E) 'P{E) that preserves C {i.e. S (C) C C). 
Then the inverse image C' = (C) of C under S defines a new connection on 

V{E), which is richer. The C -components of 5 {A ) , A G V{E), are exactly the 
images 5 {¥() of the C'-components of A. If stands for the connected opening 
associated with connection C and for that associated with C , we have 

Vx (A) = 7 a;(j (A) n A when x G A ; (A) = 0 when not (2) 

(similar technique applies also when stands for an opening, but wihout the 
statement on the connected components, and without Eq.2 [SER98b]) 

In practice, the openings characterize the clusters of objects from a given 
distance d apart. Figure I illustrates this point by ’’reconnecting” dotted lines 
trajectories. But a contrario, such an approach can also provide a means to 
extract the objects which are isolated. They will be defined by the fact that for 
them (A) = (A), an equality which yields easy implementation [SER98b]. 

Hi/ In [RON98], Ch. Ronse starts also from a first connection on V{E), 
and proposes, as a new connection, the class generated by the points and the 
connected sets opened by a given structuring element B. li x G X o B, then 
jx (A) is the initial connected component of Xo B containing x, and when point 
X € X\X o B , then (A) = {x}. For example, for such an ’’open” connection 
by a 3x3x square, the set of fig. 2a has 16 particles: the two surrounded squares, 
plus 14 isolated points. Also, the six points of the vertical gulf are isolated pores. 




a) First image of a sequence b) Space Time display of the ball 



Fig. 1. In a dilation based connection, the three clusters in grey are considered as 
particles. They correspond to the slow motions. 
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a) b) c) 



Fig. 2. a) initial set A. For the ’’open” connection by the 3x3 square, A is made of 16 
grains, namely the two 3*3 squares plus 14 isolated points ; b) (Pm<= 7 m(A) for M equal 
to the six pores of the central gulf (surrounded); c) 7 m (A) for the same marker. The 

difference comes from that the ’’open” connection is not adjacency preventing. 



2.1 Connected Filters 

For now on E is an arbitrary set, and V{E) is supposed to be equipped with con- 
nection C. For every set A G V{E), the two families of the connected components 
of A (the ’’grains”) and of A° (the ’’pores”) partition space E. Then, an oper- 
ation Ip : V{E) — > V{E) is said to be connected when the partition associated 
with 'il){A) is coarser that that of A \ SER93]. Clearly, taking the complement 
of a set, or removing some grains, or filling pores generate connected operators. 
The major class of mappings we have in view is that of the (connected or not) 
morphological filters. Let us briefly recall it 

- A mapping ip is said to be a morphological filter on V{E) when it 
is increasing and idempotent: 

A,BCE, Ac B ^ ipiA) C ip{B) increasingness 

ip{ip{A)) = ip{A) idempotence 

- In particular, a filter that is extensive (resp. anti-extensive) is 
called a closing (resp. an opening) : 

7 an opening : 7 = a filter and 7(A) C A , ACE 

Lp a closing : tp = & filter and p{A) DA , ACE 

- A granulometry is a family {7^ , d > 0} of decreasing openings 
(i.e. d > d' ^ jd Pk Id') and an anti-granulometry is a family {pd , d > 0} of 
increasing closings, both depending on a positive parameter. Every granulometry 
and every anti-granulometry satisfy the following law of a semi-group, where the 
more severe operation imposes its conditions 

p>n^ ipnipp = ippipn = tpp- (3) 

- The products (^7 and jp of two arbitrary opening 7 and closing 
p turn out to be idempotent. They are called alternated filters. Note that the 
product 77' of two openings (or pp' of two closings) is a priori not idempotent. 
Denoting hy Id the identity operator, we have the following equivalences [MAT88] 

ip{Idr\ip) and ip = yp ■0(Jd U V’) (4) 



Ip = pj 
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- When a morphological filter ij} satisfies both conditions (4), which 
provide it with robustness, i.e. when 

A,B C E, i;{A)nAC-ilj{B) C^I;{A)UA ^{B) = ^{A) 

one says that ip is strong. In particular, openings and closings are strong . 

- The notion of Alternated Sequential Filters plays, with the alter- 
nated filters, the same role as the granulometries do with 7 and (p . Let {7^} 
and {p>i}, i G Z"*", be a granulometry and an antigranulometry. The composition 
products 

Pn = and En=^nPn--liVi--llPl. 

are idempotent and called Alternated Sequential Filters (A.S.F).They do not 
satisfy implication (3) , but only the following absorption law 

p>n^ ippi>n = i’p 

2.2 Set opening by reconstruction and some derivatives 

A comprehensive class of connected filters derives from the classical opening 
by reconstruction. Its definition appears in [ SER88], ch.7.8. Significant studies 
which use this notion may be found in literature, such as [ SER93] (connected 
operators), [CRE97], (stable operators) [MEY94], (spanning trees), [ HEI97], (grain 
oprators) . 

An opening by reconstruction is obtained by starting from an increasing 
binary criterion t (e.g. '’’the area of A is > 10^^), to which one associates the 
trivial opening 

7^ (A) = A when A satisfies the criterion 
7^ (A) = 0 when not 

The corresponding opening by reconstruction 7 is then generated by applying 
the criterion to all grains of A, independently of one another, and by taking the 
union of the results : 



7(A) = U {7-7,,(A), X G E} 

The closing by reconstruction p (for the same criterion) is the dual of 7 for 
the complement, i.e. if - stands for the complement operator, then 

V? = -7- . 

For example, in if we take for criterion r, ” have an area > 10”, then 
7(A) is given by the union of grains of A whose areas are > 10, and (p{A) is 
the union of A and all its pores whose areas are <10. Similarly, if criterion r is 
expressed by ’’hit a fixed marker M”, then 7(A) is the union of the grains that 
hit A, wheras p{A) is composed of A and of all pores that miss M . 

The operators by reconstruction satisfy number of nice properties. The three 
following ones are typical examples of them. 
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Fig. 3. An example of a pyramid of connected alternated seqnential filters. Each 
contour is preserved or suppressed, but never deformed : the initial partition increases 
under the successive filters, which are strong and form a semi-group. 



Proposition 3. [ SER93]Let 7 he an opening by reconstruction, and p he a 
closing that does not create connected components, i.e. such that 

X G (p{A) A n jx'piA) ^ 0 (5) 

Then the associted alternated filters are ordered, and we have > 7)7 

Proof. Consider tp'y{A), for ACE. Since the (extensive) closing ip does not 
create new connected components, it can only enlarge those of 7(GI); now 7 acts 
grain by grain, hence 7V?7 = PJ. According to criterion 6.6 in [ MAT88c]this 
equality is equivalent to 'jp > p^. 



The most common closings may not satisfy condition (5). It is the case for 
intersections of closings by segments, for example. However, if starting from 
an arbitrary closing p, we restrict p{A) to its grains that contain at least one 
point X & A, the resulting operation is still a closing. It is the reason for which 
condition (5) is always assumed implicity in practice. 

Corollary 4. Let { 7 ^} he a granulometry by reconstruction, and {pi} be a anti- 
granulometry that does not create connected components, then the A.S.F. = 
^nPn---liPi ■■■liPi Satisfy the semi-group relation (3) 

{easy proof). This corollary explains, partly at least, why the A.S.F. by re- 
construction are so often involved in pyramids, for coding, segmentation, or 
indexation purposes. In such pyramids, the additional information to get finer 
levels is concentrated in subdivisions the flat zones [MEY94]. An example of such 
a behaviour is presented in Fig. 3. Each cross section of the gray tone image has 
processed by an alternating sequential Alter by reconstruction. The underlying 
binary criterion was here associated with the size of the disc inscribable in each 
grain. 

Another point of interest is the following. The inflmum of openings is gen- 
erally not idempotent. But consider a family {ji,i G /} of openings by recon- 
struction associated with criteria {rj}. Clearly, their inflmum 7 = 07^ is still an 
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opening, where each grain of A must fullfill all criteria Tj to be retained. On the 
other hand, U 7 i is the opening by reconstruction where each grain must satisfy 
one criterion at least. However, the largest opening is here the identity map- 
ping, and not the largest increasing operator (z.e. A E,\/A G V{E)). Hence 
we may state: 

Proposition 5. In the lattice of the increasing operators from V{E) into itself, 
the openings and the closings by reconstruction constitute two complete quasi 
sub-lattices. 



2.3 Adjacency 

The central notion of adjacency [SER98b], which governs the structure of the 
levelings below, is defined as follows 

Definition 6. Let C be a connection on V{E), and let X,Y G C. Sets X and Y 
are said to be adjacent when X\JY is connected, whereas X and Y are disjoint. 



Definition 7. Given a connected component A G C and a set M G V{E), one 
says that A touches M, and one writes A || M when either AC\ M ^ 0, or the 
grain of AU M which contains A is strictly larger than A. By duality, one says 
that A lies in M when A does not touch M‘^; one writes Aj M. 

The duality under complement provides the two following equivalences 

A II M ^ A ” and A , M ^ A j (6) 

Note that relation A || M (A touches M) is less demanding than AnM yf 0 
(A hits M), since it accepts in addition that A and M be adjacent. Similarly, 
A j M (A lies in M), is more severe than ACM, since none of the grains of A 
and of M must be adjacent to each other. 

When 7 a;(A) yf 7 y(A) for an arbitrary A G V{E) one cannot have ^x{A) || 
7 j,(A) since 7 a, (A) is the largest element of C included in A. But 7 a, (A) may not 
touch some pores Yi of A and, nevertheless, touch their union Ul^ For example, 
for the ’’open” connection Hi/ of section 2, none of the six point pores of the 
central gulf, in fig. 2a, is adjacent to the set, whereas their union touches it. 
The most powerfull coonections are those which prevent this perverse effect, i.e. 
which fullfill the following condition 

Condition 8 A connection C on V{E) is adjacency preventing when for all 
X G E and all sets A,Mg V{E)y G M and 7 a;(A) , [ 7 y(A) U ')y{A/] 
lx{A) , U{[7y(A) U Cy{A/]y G M} 

In particular, adjacency prevention governs the strenght of the filters by 
reconstruction, as proved in proposition 9 and in theorem 12 . 




198 



Jean Serra 



Proposition 9. Let C he a eonnection on V{E) such that if for all i G I we 
have A , Bi, then A , UBi. If 7 and ip stand for an opening and a closing by 
reconstruction based on connection C , then both alternated filters 773 and ipj are 
strong. 

Proof. We shall prove the proposition for pj. Because of Eq.(4), we have only 
to show that for all A G P{E), if a; G if is an arbitrary point, then x ^ 
implies x ^ B = p^[A U p^{A)] . Suppose first that x ^ A. Opening 7 can only 
enlarge pore ^x{A‘^), and closing p keep it unchanged (if not, we would not have 
x ^ p^{A)). Hence 7x(^°) is equal to ^x[A^p^{A)Y , and finally x ^ B. Suppose 
now that x G A. The grain 'yx(A) touches none of the grains and the pores of 
A that compose p'^{A) (if not, 7 ^(H) would belong to p"f{A), now x ^ p"f{A)). 
Then, according to the assumption of the proposition, ^x{A) does not touch 
p^{A), neither [A U t) 7 (H)]\ 7x(^), hence 7x(^) = 7x[^ U p^{A)] and finally 
x ^ B, which achieves the proof. 

According to Equivalences (4), the proposition implies that p^ admits a 
decomposition as p' , but for a 7' and a p' priori different from 7 and p. We 
will now see under which condition these primitives can be the same. 

2.4 Set Levelings 

Levelings have been introduced by F. Meyer, in [ MEY98b], as gray tone con- 
nected operators on digital spaces, for the usual digital arcwise connections based 
on neighbor pixels in square or hexagonal grids. In [ MAT97], G. Matheron pro- 
poses a generalization to an arbitrary space (hence, without a priori connec- 
tion). Here, connection arrives as a final result, and is generated by an extensive 
dilation. Now in both cases, levelings turn out to be flat operators, i.e. that 
treat each grey level independently of the others. This circumstance suggests 
to try and generalize F. Meyer’s approach by focusing on set levelings, but re- 
interpreted in the framework of an arbitrary connection C. J. Serra entered this 
way of thinking [SER98b], which allowed him to obtain theorem 12 

Independently of these approaches, H.Heijmans has introduced and studied 
the class of ’’grain operators” in[ HEI97]. Levelings, in the sense of definition 10 
below, are particular grain operators. However, the ’’good” properties of these 
grain operators appear when they derive from markers based openings and clos- 
ings. So we will restrict ourselves to such criteria (for example, we will not accept 
or reject a particle according to its area). 

From now on, we denote by 7 m (A) the union of all grains of set A that touch 
an arbitrary set M, called marker: 

7m(A) = U {7x(A) , x G E , 7x(A) II M} 

Similarly, the complement of closing pf^c(A) is the union of those pores of A 
that hit marker N‘^, 

[pnYA)Y = U{7x(A^) ,xGE, Jx(AY II N-}; (7) 
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hence 



n mc(A) = UMA^) ,x€E, j^(A-) j N} (8) 

is the union of those pores of A lying in marker JV. 

Definition 10. Let E he an arbitrary set, and C he a eonnection on V{E). Let 
7m and (fiffc an opening and a closing, both by marker reconstruction, from V{E) 
into itself. The leveling A : V{E) — > V{E), of primitives 7m and lpn<= is then 
defined by the relation 

A = 7 m U (- n (/?Arc) = n (- U 7 m) (9) 

where - stands for the complement operation on V{E). 

When applied to set A, leveling A yields the two equalities 

A n A(^) = An nM{Ai) , , 

A^ n A(^) = A" n <pn<=(A) A U A(A) = ^ U (Pn4A)) ^ ^ 

so that A(A) acts inside A as opening 7 m, and inside as closing (pjv^- System 
(10) also relates to the activity lattice, where a mapping if on V{E) is said 
to be less active than another, if' , when if' {A) modifies more points of A than 
if {A) does, yA e V{E), (ch.8 in [ SER88]). If Ld stands for the identity operator, 
the activity ordering is as follows 

Iddif A Lddif' 

IdU if C IdU if' 

and one notes if < if A complete lattice is associated with this ordering, where 
the supremum and the infimum of a family {ifi , i G L} are given by 

gif^ = [- n (Liifi)] u [rnfi] 
iifi = [Ld n (u^/>i)] u [nifi] . 

When applying this system to the family {7 m, TN‘} of the two leveling prim- 
itives, we draw from (9) that 

j g(p = X j f (p = Ld . 

Conversly, the relation jgip = A yields equation (9), hence may be considered 
as an alternative definition for leveling. 

An operation whose definition involves the complement - risks not to be 
increasing. But in the present case, we will now see that the condition under 
which A is increasing makes it also a strong filter, which means much more. 

Lemma 11. Let A,N G V{E), and let Y be a pore of A. LfY lies in N, then 
all grains of A which are adjacent to Y hit N . By duality, if a grain X of A 
does not touch set N, then none of the pores of A adjacent to X is included in 
N . 




200 



Jean Serra 



Proof. Consider a pore Y of A, with Y j N, and a grain X of A adjacent to Y. 
Since there exists a point x € X that is adjacent to Y, and since Y , iV° ( Eq 
(6)), X belongs necessarily to iV; hence X C\ N 

Take now a grain X of A that does not touch N, i.e. such that X j N‘^. We 
draw from the first part of the proof that every pore Y of A that is adjacent to 
X meets hence is not included in N. 

Theorem 12. Let C be an adjacency preventing connection on V{E). Given 
M,N C E with N C M, the leveling \m,n '■ 'P(E) P{E) of primitives 7m 
and ifiNc is a strong connected filter, and admits the double decomposition 

A = 7m = Pn<^1m ■ 

Proof. We have to prove that the three following oprations are identical: 

i/ to take the union of the pores of A lying in N and of the grains of A 
touching M; 

a/ to take the union A’of the grains of A touching M, and to add it to the 
pores of A’ that lie in N; 

Hi/ to add to A all its pores lying in N, and to extract from the result the 
union of all grains touching M. 

Indeed, when N C M, the lemma states that all grains of A adjacent to a 
pore Y i N hit N, hence hit also M. On the other hand, a grain 7 x(A) of A which 
is not adjacent to various grains X and pores Y of A, with Y j N , are neither 
adjacent to the union of these X and Y (assumption of adjacency prevention), 
so that the two processings i/ and Hi/ are identical. The proof is achieved by 
observing that i/ is a self-dual procedure, and that ii/ and Hi/ are dual of each 
other. 



Remark that, when N C M, the supremum of the two logical conditions A , 
M and A j is the certainty. Then, according to proposition 8.5 in [ HEI97], we 
find again the increasingness of A. For extending levelings from sets to numerical 
functions, we need to consider them as functions of their three arguments A, M 
and N. Now, is the mapping A(A, M, N) from \P {E)]^ into V (E) increasing ? 

Corollary 13. The leveling A : \P (E)]^ V {E) is increasing if and only if 
the two operands M and N are ordered by N C M 

Proof. We draw from the theorem that, given M and N, with N C M, 

ACA'^ A(A, M, N) C A(A', M, N). 

On the other hand, given A' , when M C M' and N C N' more grains of A' are 
touched and more pores of A' are lying, hence 

\{A',M,N) C \{A',M',N') 

which achieves the proof (the only if part is given by the counter example of 
fig.la 
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M=N= 



a) b) 

Fig. 4. a)Non increasingness of A when N* M. Take for A’ grain A plus its pore; then 
Ac A' whereas A(A') C A(A). b) Take the internal pore of grain A as M and N, then 
A(A)equals the pore without the grain (flip-flop effect) 



An interesting feature of levelings concerns their possible self-duality. Firstly, 
we may consider the behaviour, under complement, of the triple mapping 
(A, M, N) A(A, M, N). We have 

= [7Mc(A")]"n [An [7 jv(A)]"]" = n [A= U7 tv(A)] , 



hence [A(A^ = jn{A) U [A" n <pm'=(A)] = A(A, IV, M) 

Therefore self-duality of A(A, M, N) is reached when and only when the two 
markers N and M are identical (a result that can also be drawn from proposition 
8.3 in [ HEI97]). Since, in addition, condition M = N implies the inreasingness 
of A, we may state 

Proposition 14. The leveling (A, M, M) A(A, M, M) is an increasing self- 
dual mapping from V{E) x V{E) into V{E). 

In this approach, we implicitely supposed that the data of A and of M are 
independent. In practice, it often occurs that marker M derives from a previous 
tranformation of A itself, M = fi{A), say. Then the proposition shows that the 
leveling A : V{E) V{E) , with A = A (A, p,{A),p{A)) is self-dual if and only if 
mapping p itself is already self-dual. 

We conclude this section by exhibiting two examples showing how necessary 
are some assumptions above. Take for A a single grain with an internal pore, 
and for M = N the set made by the pore of A in fig.4b. Suppose we replace, in 
definition 10, the condition 7 x(A) || M by 7 x(A) n M = 0, and 7 a;(A'=) j Af by 
7 a; (A'^) C N Clearly, we have 

(A) = A U M (A) = A U M, 

but 7 m(A) = 0 ^ Tm<^ 1 m{a) = 0 
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whereas A(^) = M is neither nor ipM<="fM{A). Moreover, the 

example shows that A(2lnA(A)) = 0 and that A(^UA(2l)) = AUM; this implies 
that A cannot be decomposed into the product of an opening by a closing or vice 
versa (theorem 6-11, corollary 2 in[ MAT88c]). Notice also, finally, that in the 
example of fig. 4b the border between the grain and its internal pore is preserved, 
but not the sense of variation. As a matter of fact, such a ’’flip-flop” effect is due 
to the case when M contains a pore of A, but misses the surrounding grain(s). 
It cannot appear in the actual levelings of definition 10. 

The second counter-example concerns adjacency prevention. Let us adopt 
the ’’open ” connection, and take for A the set fig. 2a, and for marker M = N 
the six point pores of the central gulf. Fig. 2b and fig. 2c show the two transforms 
(Pm<=7m(AI) and 7 m <Fm‘=(AI) which are obviously different : one cannot drop the 
adjacency prevention, in theorem 12 ! 

2.5 Levelings as function of their markers 

For the sake of simplicity, we shall take M = N through this section, although 
self-duality is not really required here, and write Xa{M) for X{A, M, M). 

Theorem 15. Let C be an adjacency preventing connection on V{E). Given 
ACE with N C M , the mapping Xa ■ 'P(E) — > V{E) is a morphological filter 
from V{E) into itself. 

Proof. For A,Mg V{E), A given and M variable, Xa{M) is the union of some 
grains and some pores of A, in such a way that each accepted pore arrived in 
Xa{M) accompanied by the whole collection of its adjacent grains. So a grains 
of 7 a; (A) that does not participate to Xa{M) does not touch any of the A- 
connected elements (grains or pores) involved in Xa{M); hence, by adjacency 
prevention, 7 a;(A) , Xa{M). By duality, 7 a;(A'^) ” M implies 7 a;(A‘’) ” Xa{M), so 
that Xa[Xa(M)] = Xa{M). 

The relevant formalism to go further is that of the activity ordering for sets 
(and no longer for set mappings)) MAT97]. As a matter of fact, any fixed set A 
generates an ordering denoted by from the two relationships 

Ml n A D M 2 n A 

Ml, M2 C E 47 Ml fi.A M2 

Ml n A° c M 2 n A^ 

From this ordering derives the so called A-activity lattice, where the supre- 
mum and the infimum of a family {Mj, t S /} of sets are given by 

gAM, = [A= n (UM,)] U [nM,] 
f^Mi = [A n (UM,)j U [nM,] 

with A itself as the minimum element, and A'” as the maximum one (a system 
very similar to that presented above about the activity lattice for operators) . In 
this framework, the following theorem holds) MAT97] )SER98b] 
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Fig. 5. a) Manet’ Joueur de fifre b)and c) levelings of a) by extended extrema of 
dynamics 30 (b) and 60 (c). 



Theorem 16. Given set A, the leveling M \a{M) from the A-activity lattice 
ofV{E) into itself are openings. Moreover, when the A-activity of A increases, 
wc have 

^\a{Mi){^2) = '^Aa(M2)(-^i) = ^a{M2) 

Ml if A M 2 ^ 

^\a(Mi){M2) = ^\a(M2){Mi) = Xa{M2) ■ 

This last granulometric type pyramid is specially usefull in practice, for it 
allows to grade the activity effects of markers: it means that we can directly 
implement a highly active marker, or, equivalently, reach it by intermediary 
steps. An example is given in fig. 5. 

2.6 Function levelings 

Let T be a discrete axis; denote by the lattice of all numerical functions 
/ : E — > T. An increasing operator E on is said to be flat if there exists an 
increasing set operator ip such that 

X[E{f),t] = f;[X{f),t] (11) 

where stands for the thresholding of function / at level t, i.e. : 

X{f,t) = {x : X e E, f{x) > t} (12) 

In the discrete cases of digital imagery, relation (11) is sufficient to charac- 
terize the function operator E associated with an increasing set operator ip. 

Definition 17. Let f,g,h, be three functions from E into T,with g < h.Then 
the relation 



X [A{f),t] = A [X{f, t )] , X{g, t),X{h, t)] 



defines one and only one leveling A{f) on T^. 
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When connection C is obtained from the iterations of an elementary dilation 
i5, of adjoint erosion e, then a digital algorithm for A{f) from the data of /, g 
and h derives from the decomposition theorem 12, by computing successively 
the opening by reconstruction goo{f) and then A{f) = hoo[goo{f)]- The first 
operation is thus given by the limit of the sequence 

9n = if ASgn-i) 
with gi = {f A Sg) 



and the second one by 



hn = [ffoo(/) V shn-i] 
with hi = [5oo(/) V eh] 

All theorems and propositions 11 to 16 of the binary case extend directly to 
numerical one. Concerning self-duality for example, if 0 and m stand for the two 
extreme bounds of the gray axis T, we have 

m - A{m - f,m- g,m- g) = A{f, g, g) 

which means that the leveling f,g^ A(f, g) is always a self-dual mapping. 
In addition, when one takes for marker g a self-dual mapping {e.g. convolution, 
median operator, etc..), then the leveling A, considered as a function of / only, 
becomes in turn self-dual, and we have 

m-g{m- f) = g{f) ^ m - A[m - f, g{m - f), g{m - f)] = A[f,g{f),g{f))] 



In practice, the role of the marker is crucial. In fig. 5, the marker is obtained by 
replacing / by zero on the extended maxima and minima of /, and by leaving / 
unchanged elsewhere {extended maxima of / : do the opening by reconstruction 
7rec(/) of / from f — k, where fc is a positive constant. Then the maxima of 
7rec(/) define the so called extended maxima of /, and those points x where 
f{x) — 7rec(/)(a^) = k define the (non extended) maxima of / of dynamics l k 
; the extended minima are obtained by duality). The corresponding levelings 
are shown in fig. 5a and 5b, for markers g^o and geo, of dynamics 30 and 60 
respectively (over 256 gray levels). 

These two markers are self-dual by construction, and satisfy the condition of 
activity increasingness of theorem 16. Their progressive leveling action appears 
clearly when confronting fig. 5a and 5b. Notice the relatively correct preservation 
of some fine details such as buttons, eyes, eyebrows, fingers, etc.. These details 
are preserved because of their high dynamics. 

In figure 6, the leveling is used for noise reduction, from a marker obtained 
by Gaussian moving average of size 5, namely fig. 6b, of the initial noisy image 
fig. 6a. It results in fig. 6c where the noise reduction of fig. 6b is preserved, but 
where the initial sharpness of the edges is recovered. 

A last word. There are two ways for developing a theory in discrete geometry. 
One can start from some Euclidean notions and adapt them to discrete spaces. 
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Fig. 6. a) noisy version, b) gaussian convolution of a, c) leveling of a) by marker b) 



or elaborate the whole approach independently of the fact that it may apply to 
a continous, or a discrete, or a finite, space E. It is this second that was chosen 
here. 

Acknowledgments : I would like to thank Dr. F. Meyer for all the dis- 
cussions about levelings, and his valuable comments on the approach adopted 
here. 
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Topological Operators on the Topological Graph 

of Frontiers 



Ehoud Ahronovitz, Christophe Fiorio, and Sylvain Glaize 
LIRMM, 161 me Ada, F-34392 Montpellier Cedex 5 



Abstract. The Topological Graph of Frontiers is in our opinion a good 
graph structure representing the topology of segmented images. In this 
paper we deal with topological operators which achieve directly on the 
graph current operations performed on segmented images. 

Well known graph structures such as the Region Adjacency Graph [Pav77] 
[Ros74] do not (and cannot) keep track of the topology and so cannot 
maintain it. We claim that the structures and operators described here, 
on the contrary, allow and do this maintenance. One of the most impor- 
tant informations in such images is the inclusion of nested regions and 
one of the most important operators is the union of regions. We deal 
essentially with these in this paper. They are described in detail herein 
and we show how the topological coherence is maintained. This is why we 
entitle them topological operators. Other operators that we have already 
developed are briefly described. 

Keywords: topological operator, enclosed region, topological graph of 
frontiers, topological representation, segmented image manipulation. 



1 Introduction 

In image analysis we need tools to structure and manipulate images and their 
regions. In previous works [Fio95,AAF95] we developed a topology (called star- 
topology) aiming at a good representation of images and a graph structure called 
topological graph of frontiers {TGF for short) [Fio96]. They lead to represent 
regions and their topological relations in 2D images. This was necessary because 
there were no representations well adapted neither to the very close study of the 
adjacency relations, nor to the inclusion of regions. 

Now we aim at topological operations and manipulations on images through 
the graph structure of the TGF. We entitle them topological because obviously 
such operations should maintain the topological coherence of the resulting image 
and of course of the associated data structures. 

Here we present essential operations on regions: we focus on the topological 
union, then we show how operations like image or region(s) cutting or subgraph 
manipulations can be achieved. In order to do such operations, we have to add to 
the TGF some attributes like the depth level of regions. We also have to classify 
and mark edges according to this depth level. So we have to study thoroughly 
the inclusion of regions. Indeed, we are led to define the enclosing of regions, 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 207-217, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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which is at a first glance the inclusion of a region not only in another region, 
but in a set of regions. 

This paper is structured as follows: first we make a short review of the TGF . 
Then the depth-weighting of the TGF and the darts (sort of half-edges, de- 
scribed in the previous part) marking are detailed in section 3. They lead to the 
topological union operation described in the next section. 

The cutting and the subgraph operations are rather too long to be detailed 
here. We only give their main features, the complete technical details are avail- 
able in [Gla98]. 

All the main topics come with algorithms for each step and we study the 
complexity when necessary. 



2 Outlines of the TGF 

The Topological Graph of Frontiers is a topological consistent representation of 
images. It is based on the star-topology [AAF95,KKM90,Kov89] and is derivated 
from combinatorial maps [Lie89,Edm60]. We give hereafter the outlines of this 
structure. 

Firstly, we have to recall that the RAG (Region Adjacency Graph) does not 
preserve or maintain all the topological information about the regions in the 
image [KM95,Kov89,Fio95]. So, if we still want to process an image through 
a graph representation, we need a more complete structure. This is why we 
developed the TGF. 

2.1 The TGF structure 

The reader not accustomed to graph structures, may follow figures 1, 2 and 3 
along with this paragraph. 

The TGF is a planar multi-graph where: 

— each vertex represents one region, 

— each edge represents one frontier between two regions and not only the ad- 
jacency (e.g. two adjacent regions having n common frontiers will yield n 
edges between the corresponding vertices.), 

— each edge is composed of two “half-edges” called hereafter darts, so that 
each dart represents the frontier as a part of the contour of one of the two 
adjacent regions, 

— the inclusion of regions is represented too. 

Note: We can see that each dart represents a part of the boundary as seen 
by the interior of the region. Thence, two adjacent regions have as many edges 
as they have different (not connected) boundaries between them, and each edge 
is split into two darts, each representing the boundary as seen from the corre- 
sponding region. This is important if we want to maintain topological coherence 
[Lie89]. 




Topological Operators on the Topological Graph of Frontiers 



209 



R 



OO 



Ri 


c 


c’ R 


3 


a’ e 


R2 

*1 r 


f b 


e 

g 


R 4 

g’ 


t 




d 


d’ 





d(-Roo) = 0 

d{Ri) = 1 
d(i?2) = 2 
d(i?s) = 1 
d(i?4) = 3 



Fig. 1. Depth weighted regions 




Fig. 2. TGF of segmented image of Figure 1 



a: {a, a') {b,b') (c, c') {d,d') 

(e,e') {f,n ig,g') 
a\ R\\ (o', c, e, d) 

R 2 -- (e',/) ; (5) 

Rs: {d'J',c',b) 

R,: ig') 
oo: 0 ; (a, 6') 



Fig. 3. TGF algebraic representa- 
tion 



We recall that [Fio96] presented a linear algorithm scanning the image with 
a 2 X 2 window and constructing the TGF along the scan. 

Let G(V, D, a, a) be a TGF, V is the set of vertices, D the set of darts, a a 
permutation on D and a an involution on D. The set of edges will be denoted 
E (see hereafter). The features of such a graph are: 

1. Each region is represented by one and only one vertex. 

2. Each dart e is always incident to a vertex R in V. We say that e is incident 
to region R and denote the vertex incident to e. 
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3. Each edge is an unordered couple (e, cr(e)) where e G D. Let E be the set of 

edges; then (ef , ) G E iS there exists a frontier between R and R' . 

4. a is a permutation on D whose cycles are circular permutations on a set 
of darts incident to the same vertex. Each cycle represents one contour of a 
region and follows exactly the order induced by the set of frontiers composing 
the contour. Thus following the darts linked in one a cycle, consists also in 
following one entire closed contour of the region. 

5. Roo G E is a particular vertex which represents the exterior of the image. 
It is somewhat a region containing all the other regions of the image and 
having no exterior contour. 

6. We associate to each vertex the list of a cycles corresponding to the region 
represented. The outer contour (exterior) will be the list head except for i?oo 
(no exterior contour). 

Figure 2 and 3 show the graphic and algebraic versions for the TGF associ- 
ated to the segmented image of Figure 1. Note the list of a cycles and the void 
cycle in i?oo which has no exterior contour. 

2.2 A first point 

Advantages of the TGF lie essentially on the multi-graph modelizing the frontiers 
and the inclusion of regions. However, it lacks the structure of enclosed regions 
(deeper than the only one level of inclusion). In the next section we show how 
we added this feature. 

Now, we can define the inclusion of regions by the following: 

~ A region A is said to be strictly included in a region B iff there exists an 
edge between A and B such that the incident dart (of its edge) to A is in an 
interior contour of B. We note A ' B. 

— A region A is said to be included in a region B iff there exist a sequence 
(i?i, . . . , Rn) of regions such that A ' Ri Rn ' B . We note A C B. 

Note that this last inclusion relation defines a complete order relation. 

The TGF structure is quite sufficient to check the inclusion of regions. However, 
if we want to manipulate set of regions, or if we need to know wether a region 
is enclosed (strictly defined later) in a set of regions, we have to complete the 
information joined to the TGF . 

The next section deals with that problem and labels regions with a depth 
information and darts with an inclusion mark. 

3 Depth-weighting in a TGF 

In order to distinguish exterior and interior regions in a TGF, we associate to 
each vertex a depth parameter relative to region i?oo- We denote d{R) the depth 
of region R, and N{R) the set of the neighbouring regions of R. A neighbouring 
region is a region having a common frontier with R. 




Topological Operators on the Topological Graph of Frontiers 211 



The depth parameter can be defined by d{R) = + 1 

A TGF is said to be depth weighted if every region owes a depth parameter. 
In order to depth weight the TGF , we apply simple breadth search algorithm to 
the TGF starting with i?oo- This is rather easier than trying to weight during the 
construction of the graph. Figure 1 gives an example of regions depth weighed. 

3.1 Dart marking 

The dart marking operation will allow us to make a classification of the darts 
relatively to a set of regions. Then the classification will lead to determine regions 
enclosed in a set of regions. As this notion is important for operations like the 
topological union or subgraph extraction, let us first give a definition for the 
union of regions. 



Simple union definition: The first type of union is called simple union. It is 
valid on any set of linked regions, that is to say that for any two regions A and 
B of the set, there exists a path from a dart incident to A to a dart incident 
to B. The path may be any combination of the a and cr involutions {orbite in 
combinatorial maps). 

Let V denote the set of vertices of the linked regions R\, . . . , Rn- Then the 
simple union IJ i? of i?i, . . . , is G{V , D' , a' , a') where D' is the union of all 
incident darts to the vertices in V' and a' and a' the restrictions to D' of a and 
a. 

Note that this is only a mean to denote a set of regions in a TGF. It may 
not denote a region, as it may contain a dart d such that a{d) G D' , or even 
have two darts towards a same region for only one frontier. 

The classification of darts studied in the next section will lead us from the 
above simple union to the topological union of regions which in turn results in 
a valid region and maintains the coherence of the graph representation. 



Classification of darts The classification that we make separates the darts 
into three classes: exiting, enclosed or entering^: 

— exiting darts are defined and marked by Algorithm 1. Intuitively such darts 
lead to regions of lower depth than the depth of the regions simply united, 
that is to the outside. 

The principle of the algorithm is to find a dart of depth n — 1, n being 
the minimal depth of the simply united regions, then to examine the darts 
without going into the set of simply united regions. 

Darts are examined in the order induced by a as long as no enclosed dart is 
found (see Algorithm 1). In this case we proceed to the next region according 
to cr. Exiting darts always exist except if i?oo is in the simple union. In this 
case, exiting darts make no sense. 

^ Important remark: Note that this classification is made relatively to a given set 
of simply united regions. If this set is changed, then the classification changes too. 
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Algorithm 1: Marking of exiting darts 



Data: a weighted TGF G{V, D, a, a), the set U of vertices of the simply united 
regions, 

Pre-condition: Roo ^ U 

Result: a weighted and marked TGF G{V, D' , a, a) where the elements of D' 
are those of D with a mark on the exiting darts relatively to U. 

Punion ^ minup{R)\ 

/* find the first exiting dart */ 

b ■<— a. dart such that p{a{b)^) > Punion and cr{b)^ (f: S\ 

binit ^ b\ 

marqueib) <— “exiting”; 
repeat 

b ^ a(b)-, 
if (7(6) e S then 

/* move to next region */ 
b ^ a{b) ; 
else 

|_ marqueib) <— “exiting”; 
until b == binit', 



— enclosed darts: a dart b is enclosed if a{b) G IJi?. Such darts exist if the set 
y i? of simply united regions contains at leat two regions. Intuitively, they 
belong to edges joining two vertices of the simple union. 

Such darts should not exist in a TGF . So an algorithm making a topological 
union (see Section 4 for the definition) should discard them. 

A simple algorithm can look for non yet marked darts and mark as enclosed 
those that verify the condition. It is also possible to include this marking 
in the exiting darts algorithm (see Algorithm 1): when moving to the next 
region (6 ^ cr(6)), mark b and cr{b) as enclosed. 

— entering darts can be defined as the incident darts to y i? (incident to any of 
the simply united regions) that are neither exiting nor enclosed. Intuitively, 
they lead to a deeper region (depth parameter higher) than the set of simply 
united regions. 

If such darts exist, it means that there are regions “included” in the set 
y R. This will be a good way to define and recognize “enclosed” regions (see 
below) . 



Marking algorithm for a TGF: As seen before, all the incident darts to a 
set of simply united regions can be marked according to one of the three classes. 
What about the unmarked darts ? They are not adjacent to any region concerned 
by the union, so that we mark them “unconcerned” . 

Note that the algorithm does not modify neither te vertices nor the involu- 
tions. It merely adds a marking information to all the darts. 
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Marking example: Figure 2 represents the TGF for the segmented image of 
Figure 1. Suppose that R\ and i ?3 are simply united. The depth of the union 
is 1 as d{R\) = d{Rz) = 1. The initial dart is then a' or b' as 
cr(a')^ = Rao and d{Rao) = 0 (lower depth level) and likewise for a{b')^. 

Let a' be the initial dart. Then a' is marked exiting and we proceed to 
a{a') = c. As cr(c) = c' and c'^ G c and c' are enclosed. Then as 

a(c') = b', b' is exiting; a{b') = d' and a{d')^ G Hence d and d' are 

enclosed. Finally, a{d) = d'; the exiting darts are all marked; there are no more 
enclosed darts, therefore e and /' are marked entering and all the other darts 
are unconcerned. 

We are very close now to be able to define the topological union and the 
enclosure of regions. Thus, in the previous example, it will lead us to claim that 
i ?2 is enclosed in the topological union of i?i and R 3 . 

4 Topological union of regions 

The topological union of regions consists in replacing a set of linked regions by 
one region, while keeping coherent the TGF structure. 

When the classification (marking) process of the darts relative to a given 
simple union ends, we can easily deduce the region achieving the topological 
union. It is sufficient to: 

~ delete all the enclosed darts; 

— keep only one edge per frontier so that: 

• for a frontier with an exterior region the edge is relative to an exiting 
dart, 

• for a frontier with an interior region the edge is relative to an entering 
dart; 

— change all remaining entering darts to interior contours. 

Note that during all this process, we have to maintain the a and a coherence. 

Figure (4) shows the result of the topological union of regions i?i and i ?3 in 
figure 2, where: 

— enclosed darts c,d,d and d' are discarded, 

— exiting darts a and a' (or b and b') are deleted in order to keep one edge to 

.^00 1 

— entering darts e and e' (or / and /') are deleted in order to keep one edge 
to i? 2 , 

~ / and /' (or e and e') are changed to an interior contour. 

Before the review of the topological union algorithm (Algorithm 2), we can 
define an enclosed region. 

A region A is said to be strictly enclosed in a set S of regions iff there exists 
an edge between A and the topological union of S such that the corresponding 
incident dart to A is in an interior contour of the topological union. We denote 
it by A ◄ S. 
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Fig. 4. left: before topological union; right: after 



A region A is said to be enclosed in a set S of regions iff there exists a sequence 
(S'!, S'2, 5 „) of sets of regions such that and ySl^ G ... 

ySn-ii G Sn-1, Sn-ii < Sn ••• VS'„j G S'„, Sm < S. Wc denote it by A C S. 

The algorithm making the topological union lies heavily on the darts marking 
one. It does not modify directly the darts of the TGF as it needs the TGF 
structure during the whole marking process. Hence, for example, we do not 
discard enclosed darts as soon as they are found. 

Algorithm 2 summerizes the topological union process. A complete descrip- 
tion can be found in [Gla 98 ]. Note that at the end of the process, a new depth- 
weighting of the TGF is necessary. Indeed, after a topological union, the depth 
level is no longer true: 

Let i?i, i?2, R3 be three regions such that d{Ri) = d(i?2) — 1 = n, i?i and R2 
being neighbours, but i?3 being a neighbour of R2 only. Thus, n < d(R^) < n+ 2 . 
The depth of Ru, the topological union is n. Then R2 and Ru will have a 
discrepency of 2 in their depths, which is wrong. 

In order to avoid a complete computation of the depth, we have to check 
the depth level coherence between the exterior and all interior contours of the 
topological union Rjj. If a region Rf with a common frontier with Ru has a 
wrong depth level, then the depth of all the linked regions to Rf should be 
recomputed {Ru being excluded from this action). 



4.1 Complexity 

The algorithm scans all the darts of the simple union. Each dart is checked once, 
except the first one for each processing of algorithm 1 . Hence, the complexity is 
0 (n. (/))), n being the number of darts in the simple union and </> the complexity 
to access one element in the dart list. 
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Algorithm 2: Topological union of regions 



Data: a weighted TGF G{Vi, Di,ai,o\), a set of vertices U {Roo ^ U). 
Result: a weighted TGF G{V2, 02,012,02) in which the regions in U are topo- 
logically united into one region. 

Initialization (building Ru)', 

Select the first (exiting) dart; 
repeat 

check next exiting dart; 
if dart is redundant then 
I mark redundant 
else 

if dart is enclosed then 
I mark enclosed 
else 

mark exiting; 

|_ add to Ru’, 

until current dart == starting dart’, 
while there exists a non marked dart do 
select this dart; 
if dart is enclosed then 
I mark enclosed 
else 

/* entering * / 

check all associated entering darts; 
add them to Ru’, 

/* All concerned darts are checked */ 
discard the simply united regions; 
update a and cr; 
recheck depth level; 



5 Other operators 

We defined also a cutting operator as well as extracting parts of images along 
with their subgraph structure. 

For example the cutting operator allows to cut a region into two different 
disctinct regions (see Figures 5 and 6). The problem is that we cannot cut a 
region according to any given path (see Figure 7). 

[Gla98] contains a detailed study of valid interpixel paths (see Figure 8) for 
the cutting operator. The algorithm principles are rather long. Figures 5 to 8 
give a slight idea of how the algorithm works and how it checks the validity of a 
cutting path. 
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Fig. 5. Cutting path 
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Fig. 6. Regions after cutting 
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Fig. 7. Non valid cutting paths 



6 Conclusion 

We have defined new topological operators on the TGF . They enlarges the use of 
such a graph structure in image analysis. We can now merge regions or cut them, 
extract parts of images or include such elements, while the topological coherence 
is maintained. This last fact is very important and makes the difference with 
many classical operators that do not take it into account. 

Essentially, we showed in this paper that the TGF is a well-suited struc- 
ture for topological operations. The depth-weighting and the dart marking are 
basic processes that lead to a deep view of the inclusion of regions and allow 
sophisticated search paths in an image. 






1 1 


Cl 


1 1 
1 1 



Fig. 8. Valid cutting paths 
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We can decide to go into a set of regions by an entering dart, scan it com- 
pletely or partially by a or a, then skip to another region or set of regions. In 
fact, we can repeat such operations as needed, due to the topological coherence. 

We have to acknowledge that we have not yet thought about similar 3D 
operations since defining such a structure for 3D image is still under research. 
We focus currently on new operators, for example the minimal set of regions 
containing a region or linking non adjacent regions. 
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Abstract. Some new notions based on orders and discrete topology are 
introduced. We investigate the notions of unipolar and free points, we 
propose some discrete definitions for homotopy and a generalization of 
the notion of simple point. 
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1 Introduction 

Several approaches have been proposed for the study of topological properties 
of binary digital images: 

- the digital topology approach introduced by A. Rosenfeld ([15], [11]). Ele- 
ments of Z" are linked by some adjacency relations which allow to define connect- 
edness. There is no straightforward way to build a topology which corresponds 
to this notion of connectedness ([6], [2]). 

- the connected ordered topological space (COTS) approach introduced by E. 
Khalimsky ([7], [8], [10]). The smallest neighborhood of each point of Z” differs 
from one point to another. This allows to recover the structure of a topology. 

- the complex cellular approach. An object is seen as a structure consisting of 
elements of different dimensions called cells. As noticed by V. Kovalevsky [12], 
it is also possible, with this approach, to recover the structure of a topology. 

The topology which is used in the last two approaches is an Alexandra ff or a 
discrete topology [1] which is a topology such that every intersection of open sets 
is open. There is a link between Alexandroff topologies and (partially) ordered 
sets (orders), i.e., sets on which a reflexive, antisymmetric, and transitive binary 
relation A is deflned. Recall that a topological space E is To-separable if, for any 
two elements of E, there exists an open subset O such that exactly one of the 
two elements is in O. We may define an order on the points of any To-separable 
topological space. This order is such that x ^ y if and only if x is in the closure 
of {y}. Conversely, an order determines an Alexandroff topology: a set O is open 
for this topology if and only if x G O and x ^ y implies y G O. 

This paper is the first of a series which investigate some new notions for 
discrete topology. We introduce the notion of unipolar point which may be seen 
as the simplest “inessential” element. We propose some discrete definitions for 
homotopy, and show there is a link between retraction and unipolar points. Then 
we present a generalization of the classical notion of simple point. 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 218-228, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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2 Basic notions 

If X is a set, ViX) denotes the set composed of all subsets of X, if S' is a subset 
of X, S denotes the complement of S in X. If S is a subset of T, we write S CT, 
the notation S C T means S CT and S yf T. If 7 is a map from ViX) to V{X), 
the dual of 7 is the map *7 from V{X) to ViX) such that, for each S C X, 
*7(S) = 7(S). Let 0 be a binary relation on X, i.e., a subset oi X x X. The 
inverse of 9 is the binary relation {{x,y) G X x X; {y, x) G 9}. We also denote 9 
the map from X to ViX) such that, for each x of X, 9{x) = {y G X, (x, y) G 9}. 
We define 9° as the binary relation 9° = 9 \ {{x,x)]x G X}. Let xq and Xk be 
two elements of X. A 9 -path from xq to is a sequence xq, xi, x^ of elements 
of X such that Xj G 9{xi-\), with z = 1, k] the number k is the length of the 
path, xq is its origin. 



2.1 Orders and discrete topology 

An order is a pair |AT| = (A, a) where A is a set and a is a reflexive, antisym- 
metric, and transitive binary relation on A. An element of A is also called a 
point. The set a{x) is called the a-adherence of x, if j/ G a{x) we say that y is 
a-adherent to x. We also denote a the map from V{X) to 7^(A) such that, for 
each subset S of A, a{S) = U{a(x); x G S'}, a{S) is called the a-closure of S, 
*a(S) is called the a-interior of S. A subset S of A is a-closed if S = a(S), S 
is a-open if S = *a(S). 

Let (A, a) be an order. We denote /3 the inverse of a. Sometimes, in order 
to recall this notation, we also call an order any triplet (A, a,/3) where (A, a) 
is an order as defined above and (3 is the inverse of a. The dual of the order 
(X,a,P) is the order (A, /?, a). Note that *a(S) = {x G S; /3(x) C S|, and 
*/3(S) = {x G S; a(x) C S|. 

The set Oa composed of all a-open subsets of A satisfies the conditions for 
the family of open subsets of a topology, the same result holds for the set 
composed of all /3-open subsets of A; we denote respectively Ta = (A, Oa) and 
7^ = {X,Op) these two topologies. These topologies are Alex andr off or discrete 
topologies [1], i.e., topologies such that every intersection of open sets is open. 
The closed subsets for Ta (resp. 7^) are the a-closed (resp. /3-closed) subsets as 
defined above. The a-adherence (resp. /3-adherence) of a point x is the smallest 
neighborhood for the topology 7)3 (resp. Ta). 

When (A, a) and (A', a') are orders, a map / from A to A' is order pre- 
serving or isotone if y G a(x) implies f{y) G a'[/(x)]. We say that / is an 
isomorphism from (A, a) to (A', a'), if / is a bijection so that y G a{y) if and 
only if f{y) G a'[/(x)], for all x, y in A. 

Let (A, a,/3), (A',a',/3') be two orders and let / be an isotone map from 
A to A'. Let us consider an element U' G Oa'. Let x be a point of f~^{U'), 
that is /(x) G [/', and let y G /3(x). We have f{y) G /3'[/(x)], hence f{y) G U' 
and y G f~^{U'). Thus, f~^{U') G Oa. So, the inverse of any a'-open set is an 
a-open set: any isotone map is continuous. The converse may be easily verified: 
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If (X,a) and {X',a') are orders, a map f from X to X' is isotone if and only 
if f is continuous with respect to Ta and 'Ta' ■ 

If {X, a) is an order and S' is a subset of X, the order relative to S is the 
order |S| = (S, a n (S x S)); we denote as = aC\ {S x S)). When no confusion 
may arise, we also denote |S| = (S, a). 

2.2 CF-orders 

An order {X, a, ff) is countable if X is countable, it is locally finite if, for each x 
of X, [a U /3](x) is a finite set. A CF-order is a countable locally finite order. If 
|A| = (A, a, (3) is a CF-order, a path in |A| is a 0-path with 0 = a U /3. 

It may be seen that a CF-order {X,a,j3) is connected for 7^ (or for Tp) if 
and only if it is path-connected, i.e, if for all x, y in X, there is a path from x 
to y. 

Let (A, a) be a CF-order. An element x such that a°{x) = 0 is said to be 
a-terminal. A point y is an a-terminal of x if y is an a-terminal and y G a(x). 
If (X,a,/3) is an order we define a* as the relation on A such that y G a*{x) 
if and only if y G a°{x) and a°{x) D !3°{y) = 0. The set a*{x) is called the 
a-closeness of x, if y G a*{x) we say that y is a-close to x. 

Note that a* is antireflexive (Vx GX, x a*(x)), antisymmetric, and antitran- 
sitive {ffx,y,z G A, if y G a*{x) and 2 ; G a*{y), then 2 ; ^ a*{x)). In fact, a is 
the smallest reflexive and transitive relation containing a*. We also note that 
the relation /3* is the inverse of the relation a*. 

Let S' be a subset of A. It should be pointed out that, in general, a* is different 
from a* n (S X S). In fact, if x and y are points of S, y G a* if and only if 
y G a{x) and o;°(x) D !3°{y) C S. Thus if x, y G S and if y G a*(x), then 
y G a* (x), the converse is, in general, not true. 

Let |A| = (A, a) be a CF-order. The a-decomposition of |A| is the family 
T = {Aq, Ai, ..., Afc} such that: 

i) Aq is composed of all a-terminals; and 

ii) Xi is composed of all as-terminals, with S = A \ UjA^; j = 0, ..., i — 1}; and 

iii) Afe yf 0 and A = U{Ai; t = 0, ..., k}. 

The a-rank of a point x is the number ra(x) = i such that x G Ei. 

The family IF is a disjoint family of non-empty subsets of A whose union is A: 
IF is a partition of A. 

We note that ra{x) is the length of a longest a°-path having x as origin. 

If y G a(x) and ra{y) = ra(x) — 1, then y G a*(x), the converse is, in general, 
not true. 

3 Unipolar and free points 

We introduce the notion of unipolar point. Intuitively an unipolar point may 
be seen as the “simplest inessential” element of an order, in the sense that its 
removal does not “change the topology” of the order. The introduction of the 
notion of free point is motivated by the Property 4 which indicates that the 
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adherence of such points has a specific structure (see [5] for a presentation of 
this structure). 

Definition 1: Let (X,a) be a CF-order. We say that a point a; of X is 
a-unipolar\i a*{x) consists in exactly one point. 

Thus, a point x is a-unipolar if and only if there exists a point y € X such 
that a°{x) = a{y). 

Definition 2: Let |X| = {X,a) be a CF-order and let x be a point of X. 
We say that x is a- free if x is a-unipolar or if there is a sequence xo,...,Xk with 
Xk = X, such that xq is a-unipolar and Xi is Oi-unipolar, with Oi = aC {Si x Si), 
Si = X \ {xq, ...,Xi-i\, i = A point which is not a-free is called an 

a-link. The a-kernel of X is the subset of X composed of all a-links of X. Two 
orders {X,a) and (X',a') are said to be a-equivalent if the orders relative to 
their a-kernel are isomorphic. 

If there is a unique point y which is the a-terminal of x, with y ^ x, then 
it may be seen that x is necessarily a-free. The converse of this property is not 
true. 

Property 3: Let {X, a) be a CF-order, f be an a-unipolar point, and let 
X be a point of X, with x yf /. If x is a-unipolar, then x is as-unipolar for 
S = X\{f}. 

Proof: Suppose x is a-unipolar. We denote x' (resp. /') the unique point of 
a*(x) (resp. a*(/)). 

i) Suppose / yf x'. Then x' G a* (a^)- Suppose there exists a point y, with y yf x' 
and y G a* (x); thus ag(x) fl (3g{y) = 0 . We must have a°(x) fl (3°{y) yf 0 , this 
implies that a°(x) fl (3°{y) = {/}, and a°(x) fl P°{f) = 0 , / would belong to 
a*(x) which is impossible. Thus x is ag-unipolar. 

ii) Suppose / = x' . Then it may be seen that we must have a* (x) = {/'}. □ 

Note that the converse of Prop. 3 is, in general, not true: if x is ag-unipolar 

for S' = X \ {/}, then x is not necessarily a-unipolar. 

Property 4: Let (X, a) be a CF-order, f be an a-unipolar point, and let x 
be a point of X, with / yf x. The point x is a-free if and only if x is ag-free for 
S = X\{f}. 

Proof: If X is a-unipolar we are in the conditions of Prop. 3. If x is a-free 
and not a-unipolar there is a sequence xq, ..., Xfe with Xk = x, such that xq is a- 
unipolar and Xi is a a^-unipolar, with a^ = a n {Si x Si), Si = X \ {xq, ..., Xi_i}, 
i = Suppose / yf Xi, with i = l,...,k. By using k times Prop. 3, / 

is ai-unipolar,..., afc-unipolar. Let S = X \ {/}, S' = S \ {xq, ..., Xi_i}, a' = 
a n (S' X S'). The point Xq is as-unipolar, Xi and / are a-unipolar, thus Xi is 
a'-unipolar, i = I , ..., k (Prop. 3). Hence Xk is as-free. 

If / = xo, Xk is as-free, suppose / = Xi, for some i = 1, ..., k. The point xq is 
as-unipolar, if 1 < j < t, xj is a'-unipolar (by the preceding analysis) and if 
i < j < k, trivially Xj is a'-unipolar. Thus Xk is as-free. 

On the other hand, by the very definition of a-free points, if x is as-free, with 
S = X \ {/}, then x is a-free. □. 

We may reformulate the Prop. 4 in the following way: if (X, a) is a CF-order 
and if X G X is a-unipolar, then the as-kernel for S = X\{x}, is precisely the a- 
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kernel. This indicates a way for computing the a-kernel: we choose arbitrarily an 
a-unipolar point and we remove it; we repeat this operation until idempotence. 

Since, in a CF-order, connectedness is equivalent to path-connectedness, we 
see that, if we remove from X an a-unipolar point, we do not change the num- 
ber of connected components of X. Thus, by induction, we have the following 
property: 

Property 5: A CF-order {X, a) is connected if and only if the order relative 
to its a-kernel is connected. 

4 Collapse and free points 

Let if be a set. An (abstract) simplicial complex on if is a family if of finite and 
non-empty subsets of if, called simplexes, such that (see [14]): 

i) if a; e if, then {cc} S IF; and 

ii) if S' G IF, then each non-empty subset of S belongs to T . 

If S G has precisely A: -I- 1 elements {k > 0), k is the dimension o/S. If if yf 0 
the dimension n of IF is the largest k such that IF contains an n-simplex. Any 
T C S, with S, T G IF is a face of S, if T is a proper subset of S, T is o proper 
face of S. A principal simplex of IF is a simplex which is not a proper face of any 
simplex in T . For any S ^ T ^ & free face of S is a proper face i? of S which is 
not a proper face of any simplex of IF besides S. If T has a principal simplex S 
which has a free face T, the family ^ = 1F\{5', Tjisa simplicial complex, we 
say that this complex has been obtained from T by an elementary collapse. A 
sequence of elementary collapses is a collapse. 

To any simplicial complex IF on if we may associate the CF-order |A| = 
(X,a,(3), with X = T and a =A, thus G G a{F) ii F A G. It may be seen 
that a principal simplex is a /3-terminal, a free face i? is a /3-unipolar element. 
Nevertheless, the notion of unipolar point in an order as it has been introduced 
is different from the notion of free face used in simplicial complexes. In simplicial 






Fig. 1. Kernels of an order associated to a simplex 
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complexes we cannot remove free faces in “parallel” (the resulting object would 
not be a simplicial complex), so it is not possible to define, for example, the 
equivalent of the notion of a-kernel. In Fig. 1 (1), we give an illustration of 
an order |X| associated to a 2D simplicial complex: we have E = {a,h,c,d,e\, 
^ = {{a}. {H. {c}> {d}, {e}, {a, b}, {b, c}, {b, d}, {c, e}, {c, d}, {d, e}, {6, c, d}, 
{c,d,e}}. An element x of E which contains 1 (resp. 2, 3) elements of E is 
represented by a black dot (resp. segment, triangle). The order |A| contains no 
a-free points. In Fig. I (2), the /3-kernel of |A| is depicted, in Fig. 1 (3) the a- 
kernel of the precedent order is given, then we have the /3-kernel of the precedent 
order (Fig 1 (4)) which consists in the element {b, c, d} which is not a simplex. We 
see that the notion of dimension is different from the one which is often associated 
to orders and which corresponds to the dimension of simplicial complexes. In this 
framework, an element of an order is considered as a 0-dimensional entity. 

5 Some orders associated to 2 ^ 

We give now a presentation of some orders which may be associated to Z", these 
orders are equivalent to the ones obtained in the framework of connected ordered 
topological spaces introduced by E.D. Khalimsky [7]. 

Let if be a set and let if" be the Cartesian product of n copies of E. An 
element a of if” may be seen as a map from {!,... ,n} to if, a{i) is the t-th 
coordinate of a, / = 1, ..., n. If S' is a subset of if", the i-th projection of S is the 
set S(i) = {a(z); a € S}, i = 1 , ..., n. 

oOoOoOoOoOo 

oOoOoOoOoQo 

oO«#oOoOo0o 

oOo#oOoOo#o 

oO»OoO»0»0» 

oOoOoOo#oOo 

oOoOoOoO»Oo 

oOoOoOoOoQo 

oOoOoOoOoOo 
(a) (b) 

Fig. 2. An object in the set representation (a), and in the array representation (b) 



Let Z be the set of relative integers. We consider the families of sets Hq, Hi, 
such that, iiQ={{a}; a & Z}, H\ = {{a,a + l}-, a € Z}, = iig U ii| . A 

subset S of Z" which is the Cartesian product of exactly m elements of Hi and 
(n — m) elements of H^ is called a m-cube of Z", we denote ii" the set composed 
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of all m-cubes of Z", m = 0, n. A m-cube of Z” is called a singleton if m = 0, 
a unit interval if m = 1, a unit square if m = 2, a unit cube if m = 3. 

We denote *Z = {a + 1/2; a G Z} and Z+ = Z U *Z. We consider the map 
if'i on to Z+ such that, for each x of , 'l^i{x) = a \i x = {a}, a G Z and 
= a + 1/2 if x = {a, a + 1}, a G Z. 

We also consider the map on iJ" to Z" such that, for each x of the 
i-th coordinate of ^n{x) is the image of the z-th coordinate of x under •f'l, 
i.e., [tf'„(x)](z) = ^i[x(i)], i = Note that 'f'n is a bijection. If S' C -ff”, 

if'„(S) = {'Fn{x),x G S} may be seen as an array representation of S. 

We consider the basic order associated to Z" as the order where 

a =Z, thus y G a{x) ii x Z y. Let us examine the Fig 2 (a) where a subset S 
of is depicted, using the usual representation for sets. The same subset S 
is represented in Fig. 2 (b) as an array representation. We see that the subset 

5 is made of two connected components Si and 82 - We note that Si has two 
a-terminals, one a-free point, one /3-terminal, three /3-free points. The set Si is 
/3-equivalent to an isolated point. 

6 Homotopy and deformation retract 

We propose some definitions relative to homotopy which may be seen as dis- 
crete equivalents of the classical continuous ones [14]. In order to make such a 
transposition, we first give few preliminary notations. 

Let |A| = (X,a) and |A'| = {X',a') be two CF-orders. The Cartesian 
product of the two orders |A| and |A'| is the order \X x X'\ = (A x X',a x a') 
where a x a' is the relation on A x A' such that for each {x,x') G A x A', 
(a X a'){x,x') = {{y,y') G A x A'; y G a{x) and y' G a'(x')}. We define 
Ik = {{d) {h * + l}i * G Z and 0 < z < k}, with k G Z,k > 0. We consider the 
CF-order \Ik\ = {Ik, 2), \Ik\ is the order associated to an interval of the set of 
integers. 

Let |A| = (A, a,/3) and |A'| = (A',a',/3') be two CF-orders. As in the 
continuous framework, we could say that two isotone maps f, g: X ^ X' are 
homotopic if there is a homotopy h between them, z.e., an isotone map h : 
A X Jfc — > A' with h{x, {0}) = f{x) and h{x, {k, fc -I- 1}) = g{x) for all x G A, 
and for some A: > 0. In fact h is isotone if Vx G A, Va G Ik, if z/ G a(x) 
and a Z 6, we have h{y,b) G a'[h{x, a)]. Let us denote hi = h{x,{i}), and 

K+ 1/2 = h{x, {*, i + 1}), i = 0, ..., k. 

i) If a = {z}, with z > 0, then we have b = {z}. The condition is reduced to 
hi{y) G a'[hi{x)], we get the isotone condition for hi. 

ii) If a = {z,z -I- 1} with z > 0, then we have b = {z}, b = {z -I- 1}, or 6 = 
{z,z -I- 1}. If 6 = {z,z -I- 1} we get the isotone condition for /zi+i/ 2 - It remains 
the two conditions hi{y) G cii'[hi+i/ 2 {x)], and hi+i{y) G a'[hi+i/ 2 {x)] which are 
equivalent to h^+i/ 2 {x) G I3'[hi{y)\, and hi+i{y) G a'[/ii+i/ 2 (a;)]. 

This leads to the following definitions: 

Definition 10: Let |A| = {X,a,(3), |A'| = {X',a',f3') be two CF-orders and 
let / and g be two isotone maps from A to A'. We say that g is a-homotopic 
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to / if Vy € a{x), g{y) G a'[f{x)]. We say that / and g are homotopic if there 
is a homotopy between them, i.e., a sequence of isotone maps from X to X'\ 
/iQ, hk, with ho = f, hk = g, and such that hi is a- or /3-homotopic to hi-i, 
with z = 1, k. In this case we write f — g. 

Definition 11: Let \X\ = {X,a,f3), \X'\ = {X',a',(3') be two CF-orders. 
An isotone map / from X to X' is called a homotopy equivalence between X and 
X' if it possesses a homotopy inverse, i.e., an isotone map g from X' to X with 
(/ o / ~ Idx and f o g ~ Idx' • In this case we say that X and X' are homotopy 
equivalent. A CF-order is contractible if it is homotopy equivalent to an order 
which consists solely in one point. 

It may be seen that a CF-order is contractible if and only if there is a constant 
map c : X ^ {a}, with a G X, such that c and Idx are homotopic. 

Definition 12: Let |A| be a CF-order and let S C X. The subset A is a 
retract of X if there is a retraction, i.e. an isotone map from A to S' such that 
r{x) = X for each x G S. If a, retraction r is homotopic to the identity as a 
map from X to X, then r is called a deformation retraction and S is called a 
deformation retract of X. If this homotopy ho, ..., hk between r and the identity 
can be chosen so that hi{x) = x for all a; G S and all i with 0 < z < fc, S is called 
a strong deformation retract of X. 

Definition 13: Let |A| be a CF-order, S C A be a retract of A and r be the 
corresponding retraction. If r is a-homotopic to the identity as a map from A 
to A, i.e., if Vx G A, Vy G a{x), r{y) G a{x), then r is called an a-deformation 
retraction and S is called an a-deformation retract of X. 

Let So, ..., Sk be a sequence of subsets of A, with So = X, Sk = S, and let 
To, ...,rk, ro = Idx, fk = Ids, be a sequence of maps such that ri is an a- or a 
/3-deformation retract on Si, Si being the image of Si-\ under r^-i. Then it may 
be seen that no ...oro is homotopic to ri_i o ... o tq, therefore S is a (strong) 
deformation retract of A. 

We can associate to each a-unipolar point a an elementary a-deformation 
retraction. Let a* (a) = {6}: we can consider the map r with r(a) = b and 
Vx G A \ {a}, r(x) = x. In fact, we also have: 

Property 14: Let |A| = (A, a) be a CF-order and let r be an a-deformation 
retraction. If r{x) yf x, then x is an a-free point. 

Proof: We have Vx G A, Vy G a(x), r(y) G a[r(x)] and r(y) G a(x). 

Let X be a point with r(x) yf x, we denote y = r(x), we have y G a°{x). The 
a-rank of x cannot be 0. If the a-rank of x is 1, we easily see that x is necessarily 
a-unipolar. Thus the property is true for all points x with ra{x) < 1. Suppose the 
property is true for all points x such that ra{x) < (z — 1). Let x with ra{x) = i. 
Let us consider the set Y = a°{x) \ a{y). By isotony of r, all points z in Y are 
such that r{z) G a{y), thus r(z) yf 2 ;. All these points have an a-rank lower than 
X, by hypothesis, they are a-free. We have a* (x) = {y}, with S = X\Y. Let us 
denote Z the set of points which appear in the sequences corresponding to the 
definition of free points (Def. 2) and relative to the points of A. It may be seen 
that we have a^{x) = {y'}, with T = X \ Z and for some y' G a{y). Thus the 
point X is ay-unipolar. □ 
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7 Simple points 

We propose now a notion of simple point based on contractible subsets. 

Definition 15: Let |X| = {X, a) be a CF-order. A point cc of A is a-simple 
for \X\ if a°{x) is contractible. 

Let |A| = (X,a) and \X'\ = {X',a') be two CF-orders. We say that \X'\ is 
lower a-equivalent to |A|, or that \X\ is upper a-equivalent to |A|', if there is 
a sequence Xo,...,Xk, with Xq = X, such that Xi is obtained from Xi-i by 
removing an a- or a /3-simple point for |Ai_i|, i = and such that \X^\ 

is isomorphic to |A'|. We say that |A| and \X'\ are a-equivalent, if there is a 
sequence |Ao|, ..., |Afc|, with |Ao| = |A|, \Xk\ = \X'\, such that \Xi\ is lower or 
upper cr-equivalent to |Ai_i|, i = 1, ..., k. 

Let us consider Fig. 3 (a) where an order is depicted. Black points and 
segments constitute the elements of the order |Ai|, as usual we consider that 
y G a{x) if X 3 y. The only difference between the order X 2 of Fig. 3 (b) and 
Xi is an element s which is represented by a square. We see that a°(s) is con- 
tractible: there is a sequence of a- and /3-unipolar points which reduces a° to a 
single point; so X 2 is upper cr-equivalent to Xi. We note that all elements inside 
s are /3-simple except the four corners and that the order A3 (Fig. 3 (c)) is lower 
cr-equivalent to X 2 , A4 (Fig. 3 (d)) is upper a-equivalent to A3, we see that we 
cannot remove the square s since o;°(s) is not contractible. The order A5 (Fig. 
3 (e)) is upper a-equivalent to A3 and we can remove s which leads to Ag (Fig. 
3 (f)). We note that this notion of equivalence allows to consider objects with 
different cardinality and thus it takes into account a difficult problem in discrete 
spaces. We also see that the intersection of several curves may be “moved” with 




Fig. 3. Illustration for simple points 
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this notion of equivalence. Having this last example in mind, it is also possible to 
check that a “dunce hat” is a-equivalent to a point. Dunce hat is a well-known 
counter-example for collapse operation. The dunce hat is an object which may 
be realized in a three-dimensional space and which has “no holes and cavities” , 
z.e., its homology groups are all trivial. At last it could be seen that it is possible 
to “simulate” the collapse operation with the notion of simple point. 

Several more specific notions of simple points may be derived from the pre- 
ceding definition including some previous notions of simple points ([3], [9]). Here, 
we propose the following specific notion. 

Definition 16: Let |X| = {X, a, /3) be a non-empty CF-order. The order \X\ 
is 0- contractible if it is composed of a single point. A point x is an-simple for 
|A| if |o;°(a;)| is {n— l)-contractible, n > 1. We denote X^n the set composed of 
all points of X which are not o;„-simple for \X\. The order |A| is n- contractible, 
n > 0, if there is a sequence A°, ...,X^, with A° = X, X^ = if i is odd, 
A* = if i is even, for i = 1, ..., k, and such that X^ = {a}, a £ X. 





Fig. 4. Illustration of aa-simplicity 



The notion of oi-simple (resp. a 2 -simple, os-simple) point is appropriate 
for (resp. H^, H^), see [4]. For n sufficiently large, there is the possibility 
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that some patterns like the dunce hat may appear in the adherence of a point, 
a„-simple points cannot handle these cases. Nevertheless o;„-simple points are 
sufficient for a lot of cases and they have some interesting features: 

- The checking of an a„-simple point can be made by using polynomial algo- 
rithms; 

- From this notion, thinning algorithms may be easily designed (see [13]). We 
may thin an object by removing alternatively a„ and /3„-simple points; 

- o;„-simple points allow to have powerful parallel algorithms. The problem of 
removing in parallel simple points is a difficult problem in Z^. 

Fig. 4. gives an example where we test if a point x is os-simple or not for a 
subset S of H^, X is the central element. We have used the array representation 
with the same conventions as for H^. In Fig. 4 (a) black dots represent a°(cc)nS'. 
In Fig. 4 (b), a 2 -simple points of S have been eliminated, then /32-simple points 
(Fig. 4 (c)), and a 2 -simple points (Fig. 4 (d)): x is os-simple for S. Note that 
the same Fig. 4 may also be considered as an illustration of thinning an object 
S, with S = a°{x). 
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Abstract. In the framework known as digital topology, two different 
adjacency relations are used for structuring the discrete space . In 
this paper, we propose a model for digital topology based on the notion 
of order and discrete topology. We “validate” our model by considering 
the two fundamental notions of surface and simple point. At last, we 
give the different possible configurations that may appear in 2- and 3- 
dimensional surfaces in Z'^ which correspond to our model. 

Keywords: discrete topology, digital topology, surface, simple point 



1 Introduction 

Perhaps the simplest way for structuring the discrete space Z” is to consider 
two graphs, these graphs correspond to two adjacency relations between the el- 
ements of Z": the direct and the indirect adjacency (for example, the 4- and 
the 8-adjacency in Z^). This approach, known as digital topology [22,12], is 
widely used in Z^ for the applications to image analysis. Nevertheless, severe 
limitations appear in the discrete space Z^. For example, in the pioneering work 
of Morgenthaler and Rosenfeld [19], a definition of surfaces in Z^ is proposed, 
using such adjacency relations. This definition is not based on structural prop- 
erties which are fundamental for surfaces (e.g. the neighborhood of each point 
should constitute a “cycle” [8]). Hence, it is difficult to appreciate the relevancy 
of this definition. Furthermore, this definition seems to be almost impossible to 
generalize to higher dimensions. 

In this paper, we consider an approach based on the notion of order and 
discrete topology [1,10,14,16]. We propose a model for digital topology by asso- 
ciating, to each subset of Z”, two orders: the hit (resp. miss) order corresponds 
to the indirect (resp. direct) adjacency. 

We “validate” our model by considering the two fundamental notions of 
surface and simple point. 

We show that the notion of surface in these two orders corresponds exactly to 
the notion of surface introduced by Morgenthaler and Rosenfeld. We also verify 
that the notion of simple point introduced in these two orders agrees with the 
corresponding notion in digital topology. 

These results are remarkable, since the definitions of surface and simple point 
in the digital topology framework are fundamentaly different from ours. These 
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differences explain why we used a computer in order to prove these results by 
an exhaustive checking. A hand-made proof would be of low interest. 

At last, we give the different possible configurations that may appear in 2 - 
and 3-dimensional surfaces in which correspond to our model. 

2 Basic notions 

In this section, we introduce some basic notions relative to orders (see also [ 6 ]). 

If A is a set, 7^ (A) denotes the set composed of all subsets of A, if S' is a 
subset of A, S denotes the complement of S in A. If S is a subset of T, we 
write S C T, the notation S C T means S C T and S yf T. If 7 is a map 
from 7^(A) to 7^(A), the dual of 7 is the map *7 from 'P(A) to "P(A) such 
that, for each S C A, *j{S) = 7 (S). Let d be a binary relation on A, i.e., a 
subset of A X A. We also denote S the map from A to V{X) such that, for 
each X of A, 6{x) = {y G X, (x,y) G (5}. We define <5° as the binary relation 

= 5 \ {(x, x); X G A}. 

An order is a pair |A| = (A, a) where A is a set and a is a reflexive, anti- 
symmetric, and transitive binary relation on A. An element of A is also called 
a point. The set a(x) is called the a-adherence of x, if y G a(x) we say that y is 
a-adherent to x. 

Let (A, a) be an order. We denote a the map from V{X) to 7^(A) such that, 
for each subset S of A, a(S') = U{a(x); x G S'}, a(S) is called the a-closure of 
S, *a{S) is called the a-interior of S. A subset S of A is a-closed if S = a(S), 
S is a-open if S = *a(S). 

Let (A, a) be an order. We denote f3 the relation (3 = {(x,y); (y,x) G a}, (3 
is the inverse of the relation a. We denote 9 = a\J (3. The dual of the order 
(A, a) is the order {X,(3). 

Note that *a(S) = {x G S; /3(x) C Sj, and *(3{S) = {x G S; a(x) C S|. 

The set Oa composed of all a-open subsets of A satisfies the conditions for 
the family of open subsets of a topology, the same result holds for the set Op 
composed of all /3-open subsets of A; we denote respectively = (A, Oa) and 
Tp = (A, Op) these two topologies. These topologies are Alexandroff topologies, 
i.e., topologies such that every intersection of open sets is open [ 1 ]. 

An order (A, a) is countable if A is countable, it is locally finite if, for each 
X G A, 6{x) is a finite set. A CF-order is a countable locally finite order. 

Let (A, a) be a CF-order. Let xq and Xk be two points of A. A path from xq 
to Xfe is a sequence Xo,Xi, ...,x^ of elements of A such that x^ G 6 *(xi_i), with 
i = 1 , ..., k. 

It may be seen that a CF-order (A, a) is connected for Ta (or for Tp) if and 
only if it is path-connected, i.e, if for all x, y in A, there is a path from x to y. 

When (A, a) and (A', a') are orders, a map / from A to X' is order pre- 
serving, or isotone, if y G a(x) implies f{y) G a'[/(x)]. We say that / is an 
isomorphism from (A, a) to (A', a'), if / is a bijection so that y G a{y) if and 
only if f{y) G a'[/(x)], for all x, y in A. 
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If (X,a) is an order and S' is a subset of X, the order relative to S is the 
order |S| = (S, a n (S x S)). 

Let (X,a) be an order. An element x such that a°{x) = 0 is said to be 
a-terminal (for X). A point y is an a-terminal of x if y is an a-terminal and 
y e a{x). 

If {X, a) is an order, we define a* as the relation on X such that y G a* {x) if and 
only if y G a°(a;) and a° {x) (3° (y) = 0. The set a*{x) is called the a-closeness 

of X, if y G a*{x) we say that y is a-close to x. 

Let (X,a) be a CF-order and let a: be a point of X. We say that a; is a- 
unipolar if a*{x) consists in exactly one point. We say that x is a- free if there is 
a sequence xq, Xk with Xk = x, such that xq is a-unipolar and Xi is a-unipolar 
for the order \X \ {xq, ■■■, Xi-i\\, i = 1 , ..., k. A point which is not a-free is called 
an a-link. The a-kernel of X is the subset of X composed of all a-links of X. 
Two orders (X,a) and {X,a') are said to be a-equivalent if the orders induced 
by their a-kernel are isomorphic. 

We use a general definition for n-dimensional surfaces which has been pro- 
posed by Evako, Kopperman and Mukhin [7]. This notion is close to the notion 
of manifold used by Kovalevsky [17]; nevertheless it does not involve the neces- 
sity to attach a notion of dimension to each element of X, which allows to have 
a simpler definition (in particular, no use of isomorphism is made) . 

Definition 1: Let \X\ = (X,a) be a non-empty CF-order. 

- The order |A| is a 0-surface if X is composed exactly of two points x and y 
such that y ^ a{x) and x ^ a{y). 

- The order |A| is an n-surface, n > 0, if |A| is connected and if, for each x in 
X, the order |0°(x)| is an {n— l)-surface. 

- A (closed) curve is a Tsurface, a (closed) surface is a 2-surface. 

We will use the following notion of simple point as a model for the classical 
corresponding notion in Z'^ and : 

Definition 2: Let \X\ = (X,a) be a non-empty CF-order. 

- The order |A| is 0-contractible if X is composed of a single point. 

- A point X is a„-simple if |a°(x)| is (n — 1)- contractible, n > 0. 

We denote X^n the set composed of all points of X which are not an-simple 
points for |A| . 

- The order |A| is n-contractible, n > 0, if there is a sequence . . . , X^ with 

A° = X and X^ = {a}, a G X, such that A* = ifi is odd, and A* = A^“^ 

if i is even, for i = 1, . . . ,k. 

3 Orders associated to 2 ^ 

We give now a presentation of some orders which may be associated to Z". 

Let if be a set and let if” be the Cartesian product of n copies of E. An 
element a of if" may be seen as a map from {!,... ,n} to if, a{i) is the i-th 
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coordinate oi a, i = 1, n. If S' is a subset of if”, the i-th projection of S is the 
set S{i) = {a(z); a G S}, i = 1, n. 

Let Z be the set of relative integers. We consider the families of sets Hq, H\, 
such that, ifg = {{a}; a G Z}, Hi = {{a, a + 1}; a G Z}, U Hi. 

A subset S of Z" which is the Cartesian product of exactly m elements of Hi 
and (n — m) elements of iLg is called a m-cube of Z”. We denote iL” the set 
composed of all m-cubes of Z", m = 0, ..., n. 

An m-cube of Z" is called a singleton if m = 0, a unit interval if to = 1, a unit 
square if to = 2, a unit cube if to = 3. 

In this paper, the basic order associated to Z” is the order (H'^,a), where 
a =Z, thus y G a{x) ii x Z y. In Fig. 1, an example of a subset S of H^ is 
given. The object S is made of two connected components Ai and 82 . It may be 
seen that contains one a-terminal and four a-free elements (three a-unipolar 
elements and one element which is an a-unipolar element after the removal of 
these three elements). It follows that the a-kernel of Si is composed of a single 
element: |S'i| is a-equivalent to an isolated point. In a similar way |S' 2 | is a- 
equivalent to an open curve made of three elements. 




Fig. l.A subset S of (a), and its dual in (b). 



In the framework of 2D-image analysis, an element of an object is called 
a pixel. In a square grid, a pixel is sometimes seen as an element of Z^ and 
sometimes as an elementary square of TZ^. In fact, there is an equivalence between 
the two approaches. This comes from the fact that there is a “geometrical” 
duality in a square grid: if we consider the four elementary squares which contain 
an element of Z^, these four squares may be considered in turn as “constituting 
a unit square” . Note that we do not have such a duality in an hexagonal grid. 
In order to make more explicit this kind of duality, we propose the following 
construction. 
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We define the dual of Z as the set = {p+ 1/2; p e Zj. The dual of Z^ is 
the set = [*Z]'^. We consider the families of sets *Hq, *H\, *H^ such that, 
*Hq = {{a}; a G *Z}, *H\ = {{a, a + 1}; a G *Z}, = *Hq U 

A subset S of which is the Cartesian product of exactly m elements of *Hl 
and (n — m) elements of *Hq is called a m-cube of we denote the set 
composed of all m-cubes of m = 0, n. 

We denote /, the map from to *H", such that, for each x of i7", if x{i) = 
{p,p+ 1}, then [/(x)](z) = {p + 1/2}, and if x{i) = {p}, then [f{x)]{i) = 
{p — 1/2, p+ 1/2}, i = l,...,n. The dual of a m-cube of H'^ is the image under 
/ of this TO-cube, it may be seen that the image of a m-cube is a (n — m)-cube. 
The dual of a subset S of i?” is the subset of composed of the duals of 
all m-cubes in S. It may be seen that the map / is an isomorphism between 
the two orders (i?", D) and C). In Fig. 1, the dual of a subset S of is 

represented. 

To conclude this section, we introduce some adjacency relations among ele- 
ments of Z’^ which may be naturally derived from the order iJ". 

Two elements p and q of Z” are said to be m-adjacent if there exists a m-cube 
of iJ" which contains both x and y. Two elements of Z” are strictly 0-adjacent 
if they are 0-adjacent (i.e. if they are equal), they are strictly m-adjacent if they 
are m-adjacent and not (m — l)-adjacent, with m = 1, ...,n. 

4 The i/d adjacency approach 

In this section we give formal notions which correspond to the graph-theoretic 
approach introduced by Rosenfeld, this approach is based on two adjacencies 
relations, the i- and d-adjacencies. 

Two elements p and q of Z” are said to be indirectly adjacent or i-adjacent 
(resp. directly adjacent or d-adjacent) if they are n-adjacent (resp. 1-adjacent). 
It is well known that, in order to avoid connectivity paradoxes, we never use the 
same adjacency on both S and S. If we use the t-adjacency (resp. d-adjacency) 
for S, we must use the d-adjacency (resp. z-adjacency) for S. In the sequel the 
a-adjacency indicates the z- or the d-adjacency, i.e., we have a = i or a = d. 

Let p be an element of Z”. We denote To(p) the set composed of all elements 
of Z" which are o-adjacent to p. The a-neighborhood of p is the set r/{p) = 
Faip) \ {p}. An element q of r/{p) is called a a-neighbor of p. 

For example, each p G Z“^ has 8 z-neighbors and 4 d-neighbors, each p G Z^ has 
26 z-neighbors and 6 d-neighbors. 

An a-path is a sequence po, ...,pk, Pi being a-adjacent to Pi-i, z = 1, k. 

A subset S of Z" is a-connected if for all p, q of S there is an a-path from p to 
q which is composed solely of elements of S. 

A subset S of Z” is a simple closed a-curve if S is a-connected and if the a- 
neighborhood of each element of S is made of exactly two elements. 

Let us consider the digital plane Z^ . This set must be interpreted as a (non- 
bounded) surface. Thus it might be desirable that the neighborhood of each 
point of Z^ could be interpreted by something like a simple closed curve. We 
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see that the d-neighborhood of a point is made of four elements, and that any 
couple of these elements are not d-adjacent. In this sense the d-neighborhood of 
a point is “under-connected” . In a similar way, we see that the z-neighborhood of 
a point is “over-connected” . These peculiarities explain many of the difficulties 
encountered when using the i/d adjacency approach. Since the basic notion 
of a neighborhood is not appropriate, only ad-hoc definitions are possible. For 
example, the notion of surface in is far from being clear. Morgenthaler and 
Rosenfeld (MR) proposed the following definition (see [19], [20], [21], [11]). 

Let (a,d) = (i,d) or (d,i). 

A MR a-surface is a finite connected subset S of such that, for each p G S, 
we have: 

i) [r*{p) n S] has exactly one a-component a-adjacent to p; and 

ii) [r*{p) n S'] has exactly two d-components which are d-adjacent to {p}, we 
denote C and D these components; and 

iii) Vq G [r*{p) n S], q is d-adjacent to both C and D. 

We conclude this section by giving a characterization of simple points in 
([13,5]). 

Let S C Z^ and p G Z^. Let (a,d) = (z,d) or (d, i). 

We denote / 2 (p) = {<7 G Z^, q is strictly 2-adjacent to p}. 

The geodesic a-neighhorhood Gf (p) of p inside S is defined by: 

Gf (p) = T*(p) n s 

Gf (p) = {r*{p) n s) u {g G T 2 (p) n S; card(T;(p) n r*{q) n s) = 2} 

A point p G Z^ is a-simple (for S) if and only if the number of a-components in 
G^{p) and the number of d-components in G§{p) both equal one. 

5 The h/m order approach 

In order to build consistent topological notions for a subset S of Z”, we associate 
to S a subset 'F(S) of iL”; thus we recover the structure of a (discrete) topolog- 
ical space by considering the order (id”,Z). In this paper, the transformation 
W is chosen in such a way that the induced topological notions may be seen as 
a “model” for the notions derived from the i/d adjacency framework. A natural 
idea for defining W is to consider “hit or miss” transformations [23]. Thus we 
consider the set composed of all elements of iL" which have a non-empty 
intersection with S. In a dual way, we consider the set S'™ composed of all ele- 
ments of iL” which are included in S. Our model for the i/d adjacency approach 
consists in considering the a-kernels of the two sets S^ and S™ as counterparts 
for the i- and d-adjacencies. 

Definition 3: Let S C Z”. We denote S^ = {x G id"; a; n S yf 0}, and 
S™ = {a; G iL”; a; C S}. The h-set associated to S is the set iF^(S) which is the 
a-kernel of S^. The m-set associated to S is the set <F™(S) which is the a-kernel 
of S'^ . The d-order relative to S is the order {T^{S),a), the m-order relative to 
S is the order ('?'™(S),a), with a =Z. 
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Fig. 2 .(a): a subset S of (the seven black dots) and the set (black elements); 
(b): the /i-order relative to S. 

We note that: 

- ^ 

- The set is a-open, the set S"" is a-closed; 

- We have = S'™; 

- The set 'f'^(S) is, in general, neither a-open nor a-closed. 

The proofs of the two following properties will be given in a forthcoming 
paper: 

Property 4: Let S C Z". The setT^{S) is composed of all elements {p},p € S, 
and all m-cubes of Z” which contain two strictly m-adjacent points q and q' of 
S, m = 1, ..., n. 

Property 5: Let S C Z”. The dual of T^{S) is composed of the duals of all 
elements {p},p € S, and all the m-cubes of which are the intersection of the 
dual of {g} and the dual of {q'} for some q, q' G S. 

In Fig. 2 (a), a set S^ is represented. The original set S is represented by the 
seven black dots. We use the following conventions: a singleton is depicted by 
a circle ( o ), a unit interval by a rectangle (a), and a unit square by a square 
(□). Two elements a, b are linked by a straight line if a € a* (6) or & S a* (a). 
In Fig. 2 (b), 'f'^(S), the a-kernel of the set S^, is given. 

It may be easily seen that: 

Property 6: Let S C Z”. 

The set S is a closed curve for the h-order (resp. m-order) if and only if S is a 
closed i-curve (resp. closed d-curve). 

The following property has been proven by an exhaustive checking with the 
help of a computer: 

Property 7: Let S C Z^, let x be a point of S. 

The point x is i-simple if and only if {x} is a^-simple for the h-order. The point 
X is d-simple if and only if {x} is a^-simple for the m-order. 
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6 The h-surfaces and m-surfaces in 

Let S <Z Z^, vfe say that S is an h-surface (resp. an m-surface) if S' is a closed 
surface for the /i-order (resp. m-order). 

In this section, we show that h-surfaces (resp. m-surfaces) are equivalent to 
the Morgenthaler and Rosenfeld (MR) f-surfaces (resp. d-surfaces) [19,20,21]. 
By construction, the MR’s definition cannot be extended in higher dimensional 
spaces. Furthermore, it may be asked whether it is possible to recover the fon- 
damental structure of a combinatorial 2D-manifold ([8]) where each element is 
“surrounded by a cycle” . In fact an answer to this question has been given by 
Ayala and Al. (see [2], [3], [4]) in an original framework which provides a link be- 
tween digital spaces and Euclidean spaces. A digital object is defined as a digital 
manifold if its continuous analogue is a combinatorial manifold. In this frame- 
work, it has been shown that Morgenthaler’s surfaces are combinatorial surfaces. 
In a previous work, Kong and Roscoe [11] had proposed a structural necessary 
and sufficient condition based on specific point configurations, that characterizes 
the MR surfaces. Their approach is also based on continuous analogs. 

On the opposite, our approach is purely discrete. By drawing the inferences of 
the definition of a 2-surface, we recover very simply the local point configurations 
that may appear in h- or m-surfaces. We can even exhibit in the next section, 
the different configurations that may appear in /i-surfaces in as well as in 
3-surfaces for h-orders in Z^. 

In the subsequent figures, we use the same conventions as for the two- 
dimensional case, in addition, a unit cube is represented by a cube (QJ). 

The following property is a direct consequence of Def. 1: 

Property 8: If S C Z^ is an m-surface, then each (3-terminal element x of 
is a unit square, and 0°{x) is a closed curve composed of four singletons 
and four unit intervals (see bi. Fig 3). 

Property 9: A subset S of Z^ is a MR d-surface if and only if S is an m- 
surface. 

In the sequel, we will use “utrs” as an abbreviation for “up to rotations and 
symmetries” . 






Fig. 3. The 3 possible configurations for the set 9° (x) of a /3-terminal x in an h-surface. 
The /3-terminal x is the front central unit square in &i, the central unit cube in 62 and 

& 3 . 
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Property 10: If S C is an h-surface, then 

i) for each (3-terminal element x of'I'^{S), corresponds (utrs) to one of 

the 3 configurations bi (i = 1 . . .H) depicted in Fig. 3, and 

a) for each a-terminal element x of'F^{S), 0'^{x) corresponds (utrs) to one of 
the 13 configurations Oi (i = 1 . . .lH) depicted in Fig. 

The proof of this property is easily obtained by enumerating the different 
possible cases. 

It may be seen that each of the configurations corresponds (utrs) to one 
of the configurations that may appear in a MR i-surface (see for example [3], 
Fig. 3). We have indeed: 

Property 11: A subset S of is a MR i-surface if and only if S is an h- 
surface. 





Fig. 4. The 13 possible configurations for the set 0°{x) of an a-terminal x in an h- 
surface. The a-terminal x is the central singleton. 
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(b) 



Fig. 5. (a): a configuration in which y satisfies the MR’s definition, but which may not 
appear in a MR i-surface; (b); the set 0°{y) for the h-order. 



Remark: let us consider the configuration (a) in Fig. 5. We remark that y 
is an i-surface point, according to MR’s definition. But it is not possible that 
X satisfies the conditions of the MR’s definition, whatever neighborhood it may 
have; in other words, the configuration (a) is not extensible (see [18] for a precise 
definition of this notion). 

With Def. 1, the configuration (a) is directly rejected from the list of con- 
figurations that may appear in an /i-surface: 0°{y) is not a closed curve (see 
Fig. 5 (b)). 



7 The 2-surfaces and 3-surfaces for h-orders in 

In the Fig. 6 , 7, the set 9{x), x being a 4-cube of is represented by three 
diagrams of 27 elements, si,S 2 ,S 3 (from left to right). For each of these three 
diagrams, we use the same conventions as for the figures of the previous section, 
concerning the inclusion relation of the order Z). In addition, let us consider 
the two vectors a and — a in Fig. 6 : each element a: of si is included in x' , the 
translated of cc by a, and each element y of S 3 is included in y' , the translated of y 
by — a . For example in Fig. 6 ( 04 ), we have: x C y,x C z,x C x' , z C z' ,x' C z' . 

Property 12: If S C is an h-surface, then for each (3-terminal element x 
ofIf^{S), 9'^{x) corresponds (utrs) to one of the 6 configurations Ci (7=1.. .&): 
ci,C 2 ,C 3 correspond respectively to 61 , 62,^3 depicted in Fig. 3, and 04,05 , cq are 
depicted in Fig. 6. 

Property 13: If S C is a 3- surface for the h-order, then for each (3-terminal 
element x of'F^{S), 0'^{x) corresponds (utrs) to one of the 5 configurations di 
(i = 1 . . . depicted in Fig. 1. 
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Abstract. This article presents an algorithm computing a border map 
of an image that generalizes to the n dimension graph structures used in 
image analysis. Such a map represents simple and multiple adjacencies, 
inclusion of regions, as well as the frontier type between two adjacent 
regions. An algorithm computing a border map, linear to the number of 
elements of an image, is defined in 2D, then generalized in 3D and in nD. 

Keywords: image modeling, adjacency graph, topology, combinatorial 
maps, generalized maps. 



1 Introduction 

Before analyzing effectively the contents of an image and starting a process of 
recognition or data analysis, it is necessary to extract the valuable informa- 
tion and to structure it. This first stage of all analysis process is called image 
segmentation. Several approaches exist, here we are interested in regions seg- 
mentation which segments the image into connected component of pixels. Once 
this partition obtained, it is necessary to structure this information so that more 
accurate or specialized processes can be applied. Our aim here is to propose a 
graph like structure in order, not only to allow more sophisticated analysis of 
the image, but also to allow a modeling and thus facilitating the visualization 
and the interactive manipulation of the segmentation result. 

A commonly used structure is the region adjacency graph [Ros74]. But such a 
graph does not indicate the type of adjacency between regions: it does not specify 
neither the possible relations of inclusion between regions, nor if two regions 
have one or several common borders. Kropatsch and Macho proposed in [KM95] 
a dual planar multi-graphs structure allowing to code all these informations. The 
inclusion is coded with the help of a fictive edge that appears as a loop in the dual 
graph which allows to differentiate it from classical adjacency edges. Operations 
are thus homogeneous but require to maintain two graphs. Brun [Bru96] uses 
planar maps to which he adds some fictive edges to code the inclusion. But the 
resulting structure is no more homogeneous. 

In fact to code information about multiple adjacencies and inclusion leads 
to realize the topology of the image. Fiorio [Fio96] has introduced the notion 
of topological graph of frontiers {GTF for short). The interest of a GTF is not 
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only to define a topological structure richer in information than the classical 
structures, but also computable in linear time to the number of pixels of the 
image, with only two consecutive lines of the image in memory. The immediate 
problem for this optimal algorithm is its extension to 3D images, and nD images. 
The purpose of this paper is to propose one solution to this problem. The solution 
requires the use of the resulting structure of a regions segmentation, and related 
algorithms of topology-based modeling: combinatorial maps [Jac70,Cor75] and 
generalized maps [Lie89,Lie91]. 

We begin. Section 2, by introducing in an informal manner concepts of topol- 
ogy and combinatorial maps. Section 3 gives a short presentation of the GTF, 
then Section 4 proposes a simplification of this structure, called border map. Sec- 
tion 5 indicates how the algorithm which computes such a structure is extended 
to the 3D and then to the nD case. Section 6 summarizes up the main features 
of the algorithm presented, and Section 7 gives some examples of applications. 
Finally Section 8 concludes this paper. 



2 Topology and combinatorial maps 

Defining a coherent topology for images that are discrete spaces is not a trivial 
problem. Some simple and efficient solutions exist for the binary case (cf [KR89]) 
but they are not translatable for grey levels or color images, the case that inter- 
ests us here. In fact it appears that to define a topology for spaces such as images, 
it is necessary to introduce and to define a notion of border. Kovalevsky [Kov89], 
first, proposes to no more consider an image as only a set of pixels and intro- 
duces border elements (edges and vertices) of pixels as elements of the image. 
Khalimsky, Kopperman, and Meyer present in [KKM90] a solution of the same 
type where the image is defined like a cartesian product of discrete straight 
lines. Based on these works [Fio95] proposes an interpixel approach for image 
analysis where he also takes into account border elements (see Figure 1). He 




Fig. 1. Interpixel elements for a 2D and 3D image. 



defines a topology well adapted for image analysis: the star-topology [AAF95]. 
This topology and the one proposed by Khalimsky, Kopperman and Meyer are 
combinatorial topologies leaning on the notion of space subdivisions: the space 
is partioned into elements and border elements and provides incidence and ad- 
jacency relations linked to this notion of border. Thus IR^, for example, will be 
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divided into volumes (basic elements), surfaces (border of volumes), curves (bor- 
der of surfaces) and vertices (border of curves). A vertex is said to be incident 
to a curve if it is a border of the curve. A surface is incident to a volume if it 
is a part of its border. Two volumes are adjacent if they are incident to a same 
surface. 

Generalized maps [Lie97] are a combinatorial structure allowing to code this 
subdivision, as well as the incidence relations between elements of the space, 
thus realizing the topology. “Implementation of these structures (and related 
algorithms) can he done without loss of information and without loss of proper- 
ties^’’ . In fact maps or generalized maps [Lie89,Lie91] generalize the combinatorial 
map notion, which is the first mathematical model for topology-based represen- 
tation [Jac70,Cor75]. 

Generalized maps are therefore concerned with modeling subdivision of ge- 
ometric spaces. For example the object in Figure 2. a is modeled by the G-map 
of Figure 2.c. This last is obtained by successive decompositions of the object, 
starting by distinguishing the faces (the triangle and the rectangle), then the 
edges and at last vertices incidents to edges. Finally we get 14 half-edges called 
darts. An involution, noted oq and illustrated on faces by a little stroke, asso- 
ciates to a dart the dart of the adjacent vertex of the same edge and of the same 
face. Another involution, noted «i and illustrated by the gray arc, associates to 
a dart the dart of the adjacent edge incident to the same vertex and belonging to 
the same face. Finally an involution, noted 02 and illustrated by a thick stroke, 
associates to a dart the dart of the adjacent face of the same edge and incident to 
the same vertex. In short, involution oq puts into correspondence vertices (cells 
of dimension 0) of the object, involution ai link up edges (cells of dimension 1) 
and involution «2 joins two faces adjacent (cells of dimension 2). 





Fig. 2. From a 2D object to a 2- 
G-map 




Fig. 3. 3-G-map modeling two adjacent 
cubes 



In general, an involution ai associates to a dart of a i — cell c the dart of the 
t-cell adjacent to c. 

G-map of dimension 2, or 2-G-map (in the following, we writes k-x instead 
of X to fc dimension) realizing the topology of the object of Figure 2. a is the 
algebra (B, ag, ai, a 2 ), where B is the set of the 14 darts, resulting from the 
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decomposition, and verifying the fact that «o ° is also an involution. The 
links between darts are called sewing. 

Extending of G-maps to the third dimension is immediate: it consists in 
sewing 2-G-maps by an additional involution, noted 0:3. Figure 3 represents two 
cubes sewn along a face (involution 0:3 is represented by the thick stroke). Defi- 
nition of n dimensional G-maps is done in the same way using n -I- 1 involutions 
OO) • ■ • ) A G-map may or not represent some orientable subdivisions [Lie89]. 
Intuitively, it is sufficient to cross two 2-sewings in order to get the topology of 
a Moebius ring. 

An orientable subdivision represented by a G-map can be done by a map 
with two times less darts that a G-map. Thus, the 2-G-map of Figure 2.b is 
translated into the 2-map of Figure 4 by only keeping one dart of 2. Such a 
2-map is an algebra {B' , /3i, P 2 ), with a permutation /3i = ag o a\ (grey arcs on 
Figure 4) and one involution P 2 = ciq o a 2 (diagonal stroke Figure 4), where B' 
are the thick darts on Figure 4. 






a. 1-cells b. 2-cells c. 3-cells 
Fig. 4. 2-map Figure 2.b Fig. 5. Cells of 2-G-map of Figure 2.c 



2-G-maps allow to describe all types of subdivisions of 2-manifolds, and in 
particular subdivisions of 2D images in regions. They are characterized by the 
fact that the set of darts can be partitioned into sets corresponding to vertices, 
edges, or again to faces, whereas this is not the case for darts of 2-maps. To get 
these sets, it is sufficient to remove respectively involution oq ((1) of Figure 2.c) 
ai ((2) of Figure 2.c), 0:2 ((3) of Figure 2.c). This makes easier the writing of 
algorithms which need to operate on each of such types of cells. This is why, in 
all the following, algorithms are described in terms of G-maps and not maps. 

3 Topological Graph of Frontiers 

A GTF (see definition in [Fio95,Fio96]) corresponding to an image without 
inclusion of region is precisely a 2-map whose vertices represent regions and 
whose edges represent frontiers between regions. Such maps are the duals of those 
presented in Section 2: permutation on darts describe vertices instead of faces. 
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Fig. 6. A Topological Graph of Frontiers 



The permutation around a vertex return all the adjacent regions to the region 
represented by this vertex. A region adjacent to r enclosed regions is represented 
by a vertex where all incident darts belong to not only one permutation, but to 
r + 1 permutations. Figure 6 [Fio96], shows two permutations (grey arrows) are 
associated to region R 4 , one corresponding to the external contour of R 4 , and 
the other to the internal contour. 



li ~ 13 _ 




H LHBHE 


' yd 



Fig. 7. Precodes 



In order to extract a GTF from an image, every pixel is assumed to be asso- 
ciated to the region it belongs to. This association is the result of a segmentation 
and a region labeling, linear in the number of pixels of the image [FG96]. The 
extraction algorithm [Fio96] is a greedy algorithm: it scan the image line by line, 
starting with a GTF reduced to one vertex corresponding to the outside region 
of the image, noted Rq in Figure 6. The scan is done with a 2 pixels window and 
updates the GTF under construction according to 12 possible configurations, 
called precodes, shown in Figure 7. The algorithm manages an auxiliary list of 
darts that allows to sew the current darts of one line to darts of the previous 
line. It is very efficient, since it treats an image of 512 x 512 in about 0, 5 seconds 
on a workstation of entrance range. 
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However, this algorithm presents some drawbacks making it difficult to trans- 
late to nD: 

— Some retroactive corrections are necessary because, given two distinct con- 
tours currently processed, we cannot know if they will merge or not (see 
Figure 8). Corollarly, the GTF under construction does not necessarily cor- 
respond to the part of the image already scanned. The correspondence be- 
comes correct only at the end of the process. 

— An auxiliary list of darts and precodes is used in order to scan only once the 
image and in order to achieve the optimal complexity. 

— The inclusion tree obtained is implicit. 



current precode 




current precode 



Fig. 8. contours merging 



The representation that we propose in this paper as well as the related algo- 
rithms become free of these drawbacks so that they facilitate the extension to 
nD and allow to code explicitly the inclusion tree. Embedding informations are 
also included in our representation in order to be able to allow the display. 

4 Border map of regions 

4.1 First algorithm 

In order to get rid from drawbacks due to the algorithmic optimizations, let us 
reconsider the problem and forget momentarily questions of performances. We 
shall reintroduce them progressively. 



Algorithm 1: Naive algorithm 



Input: Image I of ni x H2 pixels 

Output: 2-G-map of whose faces represent regions and edges borders between 
regions. 

1 Build a 2-G-map G of ni x ri 2 squares sewn between them; 

2 Merge all adjacent faces couples of a same region; 

3 Merge all couples of edges of same direction incident to a vertex of degree 2; 
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Algorithm 1 creates the 2-G-map whose faces represent regions and edges borders 
between regions. 



(a) (b) (c) 

Fig. 9. Faces Merging 

The merging of two faces along an edge (Figure 9. a) is done by first ai~ 
unsewing the edge from its four incident darts (Figure 9.b), then by resewing by 
ai the four darts of the two remaining face pieces become 1-free (Figure 9.c). 

The steps described above do not manage the case of regions enclosed in 
others. A simple manner to describe implicitly the inclusion tree of regions is to 
indicate for every dart whether it belongs to an internal or an external contour 
of a region. From now on we will say respectively internal or external dart for a 
dart belonging to respectively external or internal contour. One couple of darts 
2-sewn in which one is internal and the other one is external corresponds to an 
edge of the inclusion tree. The informal definition of an internal or external dart 
is the following: 

— initially, all darts are considered internal, 

— the first dart met for a given region is external, 

— a dart belonging to a contour with at least one external dart is external. 

The space complexity of this algorithm is 0(nin2)^. Since for every dart, the 
face it belongs to is scanned in order to determine if it is external or not, the time 
complexity of the algorithm is 0 (nin 2 m), where m is the mean number of darts 
by face. These high complexities require optimization of the naive algorithm in 
time and in space. 

4.2 Algorithm 1 optimizations: the border map 

A first optimization of our algorithm consists to start with a G-map including 
only one square for the first pixel of the image and then, for every new created 
square immediately try to merge it with the current G-map. The first two steps 
of the naive algorithm are thus grouped in one. Figure 10 shows the current pixel 
in gray (Figure 10. a), that has been merged with the part of G-map containing 
a pixel on the previous line belonging to the same region, but not with the 
previous pixel on the same line since it is white and does not belong to the 
same region. The processing of a pixel consists therefore to apply, depending 
on the horizontally and vertically adjacent regions, zero, one or two times, the 
illustrated merge operation of Figure 10. This process requires to know the darts 

^ Reminder: the processed image contains rii x ri 2 pixels. 
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pixel 















(a) (b) 

Fig. 10. Current pixel merging 



of the image already processed with which the pixel must be sewn. For that, one 
dart of the pixel previously processed pixel (vertical dart in black on Figure lO.b) 
is memorized, as well as a dart in the beginning of the current line (horizontal 
dart in black on Figure 10. b), in the case of the current pixel being the first 
of the line. These darts belong to the set of darts represented in black to the 
figure 10. a that closes the G-map . It corresponds to the auxiliary list of darts 
of [Fio96]. 

The complexity in space is proportional to the number of darts of the G-map, 
which is about ni or ri 2 for images with a weak number of regions, and about 
niU 2 for images having a high number of regions. In practice, the number of 
darts is weaker than mn 2 (about 6 to 10 times less darts). 

A second optimization consists in not creating the current pixel explicitly, 
but foreseing all possible configurations. These are only 4, due to the fact that 
the current pixel (i,j) can or cannot be merged either with the pixel (i — l,j), 
or with the pixel (z, j — 1). We obtain thus an algorithm with four precodes (see 
Figure 11). 




Fig. 11. 4 precodes 



Figure 12 illustrates for each of the four cases of Figure 11 the corresponding 
operations to do on the G-map (Figure 12 .e). In the case (a), no dart is modified, 
only the coordinates of the vertex are changed, while the creation of a full pixel 
and the application of two merging operations have created and destroyed eight 
darts. For the case (d), eight darts are effectively created, and four are 2-sewn. 
For cases (b) and (c), identical up to the orientation, four darts are created 
and two are 2-sewn. The theoretical complexity in time does not change, but in 
practice, the time is highly reduced because, contrarily to the previous algorithm, 
that darts that are near be destroyed by merging are not created. 

A third optimization consists in grouping steps based on 4 precodes and steps 
of edges merging. It requires then to consider the 12 studied cases by [Fio96]. 
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(e) 




(b) 




(c) (d) 



Fig. 12. Operations on the G-maps related to the 4 precodes 



We can at the same time examine 3 additionnal precodes, corresponding to 
cases where regions are not 2-manifolds, these cases are examined exhaustively 
in [Pen97]. 

4.3 Inclusion tree 

The algorithm of internal and external dart computation can be improved by 
indicating that all darts of a contour are external as soon as a first external dart 
has been met. But so, the inclusion tree remains implicit. 




Fig. 13. Example of inclusion tree 



In order to construct effectively an inclusion tree (an example of an inclusion 
tree is shown Figure 13), it is necessary to scan again the G-map connected 
component by connected component (see Algorithm 2). Algorithm 2 starts by 
assigning the outside region (line 1) (so-called infinite region) as root of the tree. 
Then it calls the AddRegionsFilles recursive procedure (line 3) which marks 
recursively the regions enclosed in others and so can add the corresponding 
vertex to the inclusion tree. This procedure starts with the first dart of the map 
(line 1). This dart is necessarily a dart of an internal contour of the region R. 
By applying a 2 (line 2) the dart of an external contour of an enclosed region in 
R is recovered. From this dart the connected component it belongs to is scanned 
by applying in any order any number of times involutions Ooj (line 3). All 
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Algorithm 2: Creation of the inclusion tree 



Input: Image I with a connected component labeling; 

The G-map G of the image / 

Output: Inclusion tree TO 

1 it <— oo; 

2 R-, 

3 AddRegionsFilles (A,-R) ; 



Procedure AddRegionsFilles (TO; an inclusion tree, R: root of the 
tree) 



1 

2 

3 

4 

5 

6 



if G 7 ^ 0 then 

b ^ first dart of G; 
b' ^ 02(5); 

foreach darts bf,xt of the connected component of b' do 
let Rf he the region of bext', 
it Rf was not already met then 
^ to add an edge R Rf to A', 



[_ to suppress the dart b^xt of G; 
foreach regions Rf direct descendent from R in A do 

^ AddRegionsFilles i 



region Rf met during this scan lead to the creation of an edge R — > Rf (lines 4 
to 6 of the AddRegionsFilles procedure). 



5 Border map of regions of an 3D image 

The computation of the inclusion tree given to Section 4.3 stays in any dimension, 
with no modifications. Therefore we keep it to extend the construction of the 
G-map. 

The first algorithm is extended easily to dimension 3 for an image of ni x 
U2 X ns voxels. The creation of the initial 2-G-map is replaced by the one of a 
3-G-map of ni x U 2 x ns voxels. Faces merging is replaced by volumes merging, 
edges merging by faces merging, and an edges merging is added. 

The first optimization is translated just as merely: it is sufficient to replace 
the two possible faces merging by three possible volumes merging along the 
three faces of the current voxel in contact with the 3-G-map under construction. 
Figure 14 shows an example of merging: one new cube is created (a), 3-sewn 
along three of its faces (b), then the merging with the cube to the left is done 
(c). 
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Fig. 15. 8 precodes for the construction of a 3-G-map 



The extension of the second optimization consists only in replacing the 
4 pixels-based precodes by 8 voxels-based precodes (see Figure 15 and Algo- 
rithm 4), this is due to the fact that each of the three faces of the current voxel 
in contact with the 3-G-map are either deleted or are not volumes merging. 



Algorithm 4: Algorithm with 8 precodes 



Input: Image / of ni x n ,2 x ns voxels 
Sortie: The corresponding 3-G-map 

Create a first cube corresponding to the first voxel; 
foreach voxel do 

b Apply operations related to the current precode to 3-G-map; 

foreach edge of the obtained 3-G-map incident to only two faces f, f' do 
b Merge the faces / and /' (Figure 16); 

foreach Incident vertex to two edges e and e! not on the same line do 
b Merge the edges e and e'; 



The faces merging (illustrated Figure 16), consists to apply twice the pre- 
viously described one for the dimension 2, whereas in 2D the merging edges is 
carried on couples of darts (one for each of the two faces), in 3D the merging is 
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(a) (b) (c) 

Fig. 16. 3D faces merging 



carried on 4-tuple of darts (one couple for every double-face). Therefore the first 
three algorithms are easily extended to 3D. 

The extension to nD of the 3D algorithm above consists in replacing the 8 
precodes by 2” precodes of 4 cubes of dimension n, and to replace the volumes, 
faces and edges merging by the merging of n-,(n — 1)-,..., 1-cells. This extension 
may have a convenient in 4D to process animated sequences of 3D images. 

The extension of the last optimization is more difficult. A first approach 
consists in enumerating all possible precodes. 

Let Nfi be the number of precodes to the dimension d, can be computed 
thanks to numbers S^, called numbers of Stirling of first order. gives the 

gk 

number of surjections of a set of n elements in a set of k elements. gives 
then the number of partitions in k subsets of a set of n elements. In general 
n = 2"^ , it is then sufficient to add partitions of one set, two, ..., n sets to 

2<i s'”. 

get the number of precodes. One has therefore: Nd = X)fe=i ~kT knowing that 
Bn = X)fc=i where Bn are Bell numbers given by the recurrence formula: 






E n — 1 



BiXr 



with Bq = Bi = 1. 



t—0 i\x{n—i)\ 

We can easily compute the number Nd of precodes for the dimension d. In 
2D we have the 15 precodes (the 12 precodes of Figure 7 -I- the 3 cases of no 
manifold). In 3D, we get: 

I precode if only one color is present, 127 precodes for 2 colors, 966 precodes 
for 3 colors, 1701 precodes for 4 colors, 1050 precodes for 5 colors, 266 precodes 
for 6 colors, 28 precodes for 7 colors, 1 precode for 8 colors, thus a set of 4140 
precodes (including cases of no-manifold)! 

In 4D there are more than 10 000 millions of precodes, so that even without 
the non-manifold cases we cannot to consider an optimized solution based on 
precodes as the one for the GTF algorithm in [Fio96]. 

An approach under study for the 3D case consists in decomposing the process 
for eight voxels in the 2x2x2 window into three processes of 4 voxels, analogous 
to the 2D precodes. Thus the number of configuration are limited to 36, with 
the possibility, to do the merging of edges in the same steps. 

The exhaustive process of precodes is optimal in time, but impracticable. 
On the contrary, the solution with 8 precodes is simple, but requires to create 
numerous darts that will be immediately destroyed by merging. The solution of 
the 36 precodes seem to be able to be a good compromise between the difficulty 
of implementation work and performances in time. 
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6 Discussion 

We indicate here what brings on our algorithm as opposed to the drawbacks 
evoked for the computation of a FTG in Section 3. Our algorithm constructs 
a G-map of closed contours in one first scan, and then the inclusion tree in 
the second. All the time during the construction process, the G-map describes 
the part of the image already scanned, and bring on our algorithm to be well- 
adapted to process infinite images. The algorithm of computation of a GTF 
does not possess this characteristic. Indeed the current map is not closed (darts 
corresponding to the closing are in the auxiliary list) but it is the price to pay 
in order to have an optimal algorithm. 

Note that the second scan which build the inclusion tree takes place on the 
G-map, and does not require the image any more. But it requires a time propor- 
tional to the number of darts of the G-map, whereas the number of operations 
done by the one of [Fio96] is weaker since it is proportional to the number of 
contours. The counterpart is that the latter creates only an implicit inclusion 
tree, whereas the marking of darts allows to create an explicit inclusion tree. 

The proposed 2D algorithms provides 2-g-maps with embedded straight darts, 
so it allows immediate display of the map. The counterpart is that the number 
of darts is bigger than in the GTF case. To get a map having a dart number 
analogous to the FTG, it is sufficient to avoid the test of alignment. Besides, this 
characteristic implies in 3D that only the G-map and the current voxel have to 
be memorized, whereas, without embedding, two slices of the image have to be 
kept in memory in addition. Moreover, the 2-G-map with straight darts allows 
to analyse the three cases where the regions are not manifolds. 

7 Application 

The use of G-maps contributes to simplify and to generalize the algorithm. For 
the sake of performance in space, algorithms can be translated in terms of maps. 
Due to the actual memory capacity of workstations, this translation seems to be 
necessary as soon as we have 3D images. 

The 2D algorithm provides performances comparable in time to the [Fio96] 
one. An image processed in 0, 57s by the latter, requires 0, 71s for our algorithm, 
which however provides in addition the inclusion tree and an embedable map. 
From a segmentation of a 512 x 512 x 192 scanner image the computation of 
the border map of a stomach (Figure 17. a) requires 496s and 22576 darts. The 
one of the border map of lungs (Figure 17. b) requires 538s and 135466 darts. 
The different optimizations have all been written and tested in GaML. This 
implementation allowed to measure gains in time for each optimization stage. 
The final version is written in G. 

8 Conclusion 

The use of results inf topology-based geometric modeling has allowed to pro- 
pose for image analysis a modeling of nD image by a border map. Successive 
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(a) Stomach 




(b) Lungs 



Fig. 17. 3-maps of a stomach and a lung 



optimizations of a naive algorithm have shown different aspects of the problem 
of the construction of a border map of an image allowing in the same time to 
simplify, optimize and extend the algorithms to nD. 

[Fra95,Fra96] showed that concepts and methods developed in topology- 
based modeling self-apply to discrete geometry. The topology-based modeling 
often privileges the cellular models (in 2D, faces have any number of sides, and 
in 3D volumes have any number of faces), whereas the discrete geometry and 
image analysis are led by nature to simplicial structures [May67,LL95]. 

The experiments in 3D show that the algorithm is efficient in time, but that 
the size of images is strongly limited by the memory space available. It comes 
from the fact that cellular structures used are too general. They do not take into 
account that all cells are orthotrop. It seems to us that in order to partially clear 
of this limitation hybrid structures (i.e. simplicial for cells of some dimensions 
and cellular for others) should be used. 

More generally, using general cellular and simplicial structures should allow 
not only to solve problems of structuring information in image analysis, and 
especially in 3D more comfortably than with ad hoc structures, but also to unify 
methods of structuration actually different as the polyedrisation proposed by 
[KIE96], border map presented here and the different levels of representations 
proposed by [ADFQ96]. 
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Abstract. In this paper we introduce a deformable model based on 
cellular complexes. This model allows the representation of objects with 
different local dimensions, and has good topological properties. We define 
homotopic deformation on this model and prove that a local criterion can 
be used to characterize simple elements of the model. This criterion is 
used to build an homotopic deformable model that can be used for image 
processing. 



1 Introduction 

In some applications it is necessary to be able to represent volumes and surfaces 
in the same model. For example, if one wants to represent the surface of the 
brain cortex with accuracy, it is necessary to be able to model 2-dimensional 
structures because the cortex folds make this surface complicated. The cortex 
surface can, on some aspects, looks like a crumpled thick sheet of paper where 
thin structures in the folds are 2-dimensional (figure 1). It is not thus reasonably 
possible to find an analytical model to represent it. For the same reasons, the 
use of a parametric model, although possible, would result in a structure which 
is complicated and difficult to use. Various approaches were proposed to de- 
fine a surface in the discrete three-dimensional space , the graph-theoretical 
approaches [13,11,12], the approaches based on voxel boundary [1,7] and the 
combinatorial approaches [2,5]. But, in order to have a model that can represent 
objects of different dimensions and that can also be used as a deformable model, 
it is necessary to introduce a new model. 

We present, in section 2, a model based on cellular complexes that allows 
the representation of 0-, 1-, 2- and 3-dimensional structures in the same model. 
Then, in section 3, we define homotopic deformations for this model. Using the 
good topological properties of the cellular complexes, we adapt the notion of 
simple points of a binary digital picture to our model, and introduce the notion 
of simple elements that are elements that can be deleted from the model without 
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cortex surface 

white matter 
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(a) (b) 

Fig. 1. Representation of thin structures: (a) Zoom on a slice of a segmented and 
classified brain image, (b) Brain volume and the cortex surface. The background 
is locally 2-dimensional in the folds. 




changing its topology. We then prove that a unique local criterion can be used 
to characterize simple elements of any dimension. In the last section, we present 
a way to use this criterion to build an homotopic deformable model that can be 
used for the segmentation of medical images. 

The classical surface definitions, when they are used with the conventional 
cubic grid, are not free from topological paradoxes. Moreover, while being based 
only on images made up of voxels, it is not possible to represent purely two- 
dimensional structures. Indeed, either an element of surface is represented by 
a voxel in which case a surface is a three-dimensional structure, or a surface is 
made up of surfels, but in this case the membership of a surfel to the surface 
depends on the voxels in its neighborhood and it is therefore not possible to have 
purely two-dimensional objects because there is no surfel without voxel. 

Therefore another approach has to be developed in order to model complex 
objects having parts of different local dimensions. Let us take the example of 
the brain cortex. In order to be able to model the cortex as well as possible, it 
is necessary to take into account the parts of the cortex surface that are inside 
the cortical folds. It is thus necessary to introduce a new modeling which makes 
it possible to represent at the same time volumes, to model the grey matter 
and the volumes of the cerebrospinal fluid (i.e. the background), and surfaces, 
to model the thin parts of the cortex surface. Moreover, in order not to have 
to re-sample the MRI, it would be interesting if this modeling remains close to 
the cubic grid. Lastly, this model must allow the study of the geometrical and 
topological properties of the objects it represents, if one wants to be able to use 
it as a deformable model. These constraints led us to develop a model based on 
cellular complexes. 



2 The cellular model 



The traditional digital images, in the sense that they use the cubic grid, are 
subject to several topological paradoxes [15,9] among which one of most famous 
is the connectivity paradox. To avoid this paradox, it is necessary to use two dis- 
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tinct connectivity relations, one for the black components and one for the white 
components. But it does not lead to a consistent topological theory. Moreover, 
it is difficult, and in some case impossible, to transfer the most important topo- 
logical notions such as that of an open subset, open neighborhood, continuity, 
topological mapping and many others. 

On the basis of these observations and seeking a representation offering a 
consistent topological theory, Kovalevsky [10] emphasized the need for a struc- 
ture made up of elements of various dimensions. Such a structure is known in 
topology as a cellular complex [6,14]. Thus, he introduced a representation of an 
image based on the elements (or cells) of different dimensions which compose 
the cubic grid. These elements are the cubes (or voxels), the faces (or surfels), 
the segments and the points. A mapping between the cells and a set of labels 
can be considered as an image and is called a cellular model. In this paper we 
only consider binary cellular models. The connectivity between the elements of a 
cellular model is based on a boundary relationship. Two elements are connected 
(or neighbors) if one bounds the other. For example, a cube is bounded by six 
faces, twelve segments and eight points. It has twenty six neighbors. A face is 
bounded by four lines and four points but it bounds two cubes so it has 10 
neighbors (figure 2). There is only one connectivity relationship that does not 
depend on the labels of the cells but only on the dimension of the cells. There- 
fore, unlike classical digital pictures, the underlying graph is fixed, it does not 
depend on the image labels. Kovalevsky showed that this structure is free from 
the connectivity paradox. 




Fig. 2. The four neighborhoods of a cellular model: (a) the neighborhood of a 
point, (b) the neighborhood of a segment, (c) the neighborhood of a face and 
(d) the neighborhood of a cube. 



This structure allows the representation of objects with different local dimen- 
sions. It is thus possible, for example, to have pure 2-dimensional structures and 
3-dimensional objects in the same model. The topology of the object that can 
be modeled with cellular model can be very complicated. If we want to be able 
to deform an object by deleting or adding elements, it is therefore necessary to 
preserve some topological properties of the object in order to be able to avoid 
creating complicated objects from simple ones. 
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3 Homotopic deformations 

Homotopic deformations are deformations that preserve certain topological prop- 
erties. There is no unique definition of deformations that preserve topology in 
the literature. In the discrete three-dimensional topology framework, this notion 
is usually defined as the preservation of the number of connected components 
and of the number of tunnels [ 16 , 8 ]. This leads to the notion of simple points 
which are the voxels that can be removed from an object without changing its 
topology. Bertrand and Malandain [ 3 ] proposed a local characterization of simple 
points. In order to extend this concept to the cellular model, we will show that 
the same criterion can be used to characterize the simple elements of a binary 
cellular model. 

At first, we will remain as general as possible and will define the concepts 
of path, loop and elementary deformation of a loop for abstract graphs. These 
notions allow the definition of simple elements of an abstract graph. Then, we 
will use these concepts within the framework of the cellular model to obtain a 
local characterization of the simple cells. 

3.1 Definitions 

Definition 1. An abstract graph G is a pair {E, V) where E is a discrete set of 
abstract elements and V is a set of ordered pairs (x,y) where x G E and y G E. 

V is called neighborhood relationship. 

In the following we consider a non oriented graph G = (E,V), i.e. (x,y) G 

V {y, x) G V, such that Vx G E, (x, x) G V. We note O a subset of E and 
O = E — O the complementary of O. 

Definition 2. Two elements x and y of O are neighbors on O (denoted by 
O 

X ^ y) if and only if (x,y) G V. The set of neighbors of an element s of E (not 
necessary in O) belonging to O is denoted by No{s). 

Definition 3. A path j of O is either the empty path 70, or a series 

O 

So, si, s„ (n G IV) of elements of O such that Si ^ Si+iVz G [ 0 ,n — 1 ]. 
So (resp. s„) is the initial (resp. final) element ofj. The set of paths of O is 
denoted by Tq- 

Definition 4. Given an abstract graph G = {E, fE) and a subset O of E, two 
elements x and y of O belong to the same connected component of O (denoted 
by X y) if and only if there exists a path in Eq of which x is the initial 
element and y the final element. The number of connected components of O is 
denoted by Ncc{ 0 ). 

Definitions. The concatenation of two paths 7 = So.Si,...,s„ and 

7' = Sq, s'l, ..., sj„ of Eo, such that Sn ^ Sg, is the path 

7.7' = Sg, si, Sn, Sg, s'l, s'j^. The empty path is the null element for con- 
catenation: V 7, 70.7 = 7.70 = 7. 
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Definition 6. A loop uj of O is a series si, S 2 , Sn n € IN* of elements of O, 
o o 

such that Sn ^ sq and\/i € [l,n— l],Si ^ Sj+i. The set of loops of O is denoted 
by Qo- 

In order to define topological equivalence we need to be able to characterize 
a tunnel. There is a tunnel in an object when one can find a loop that cannot be 
continuously deformed into a point in the object. Therefore we need to introduce 
a notion similar to the continuous deformation of a loop. We will use the notion 
of elementary deformation of a loop which is, intuitively, the smallest possible 
continuous deformation. The iteration of elementary deformations on a loop 
forms a set of deformations that are similar to the continuous deformations. 

Definition 7. Two loops co and to' of O, are equivalent up to an elementary 

deformation (denoted by ui ^ uj' ) if at least one of the following conditions is 
true: q 

• Lo' ^ W ( 1 ) 

where tt G Tq, Si,S 2 G O (2) 

\ UJ = 7r1.S1.S2-S3.7T2 , „ „ 

• s , where 7Ti, 7T2 G t o, si, S 2 , S 3 G (d (dj 

= 7r1.s1.s3.7r2 

The condition (1) imposes symmetry. The condition (2) allows two loops that 
differ only by a circular permutation on their elements to be equivalent. The con- 
dition (3) defines an elementary deformation as a “small deviation” of a loop on 
three elements which are mutually neighbors (figure 3) . Since an abstract graph 
has no geometrical basis, it is important to define in what sense this relation 
can be seen as a continuous deformation. To choose a mapping between a graph 
and geometrical structure, one must consider the neighborhood relationship as a 
proximity relationship. Two elements are neighbors if they are “as close as possi- 
ble” . To compare the elementary deformation to a continuous deformation, it is 
sufficient that any geometrical structure composed of one element, two neighbors 
or a triplet of mutually-neighboring elements has the topology of a filled sphere. 
To deform a loop it is necessary to transform a pair of neighbors into a triplet of 
neighbors or to do the contrary. Then, any path in the object corresponding to a 
pair of neighbors (x, y) can be continuously deformed into any other path in the 
object corresponding to a triplet of neighbors {x, y, z). Therefore, the elementary 
deformation can be seen as a continuous deformation. 

We are now able to continuously deform a loop by using a series of elementary 
deformations. This gives us the following equivalence relationship between loops: 

Definition 8. Two loops to and uj' of TIq are equivalent by deformation if and 
only if at least one of the following conditions is verified: 

• uj' ^ UJ ( 4 ) 

• there is a series ujo,uji, ...,o;„ (n G IV) of loops of O such that (5) 

ff ff. fL ! 

UJ ^ UjQyUjQ ^ UJi, ..., Wn-I ^ UJn, UJn ^ UJ 



( UJ = S1.7r.S2 
( w' = S2.S1.7r 
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S2 




Fig. 3. Elementary deformation of a loop. The loop 7r1.s1.s2.s3.7r2 is equivalent 
to the loop 7r1.s1.S3.7r2. 

The “equivalence by deformation” relationship is an equivalence relationship 
on f?o. The transitivity is an immediate consequence of the definition. The 

symmetry is a consequence of the symmetry of w ^ uj' . The reflexivity is a 
consequence of the reflexivity of the neighborhood relationship. For any lv in 
170) the equivalence class of loops of O which contains oj is denoted by [oj]q. 

In order to characterize tunnels we need to use the deformation of a loop into 
a point. Therefore, we will need the following definition. 

Definition 9. A loop oj of flo is reducible in O if and only if there exists a 
loop uj' G [w]q such that to' = s, s € O. 

3.2 Simple elements 

Kong and Rosenfeld [8] give the following definitions of an image and of simple 
points: 

A conventional digital picture is a quadruple (V,m,n, P), where V = Z'^ or 
V = Z^, P QV, and where {m,n) = (4,8) or (8,4) if V = Z'^ and {m,n) = 
(6, 26), (26, 6), (6, 18) or (18,6) if V = Z^. The elements of V are called the 
points of the digital picture. The points in P are called the black points of the 
picture; the points in V — P are called the white points of the picture. 

Let Vi = (Z^, m, n, P) be a three-dimensional digital picture. Then the 
deletion of the points in a subset D of P preserves topology if and only if the 
resulting picture P2 = {Z^, m, n, P — D) satisfies the following conditions: 

each black component of V\ contains exactly one black component of 
each white component of V 2 contains exactly one white component of Pi (7) 
each closed path in P can be digitally deformed in P to a closed path in (8) 
P-D 

whenever one closed path in P — D can be digitally deformed in P to (9) 
another closed path in P — D, the first closed path can also be digitally 
deformed in P — P> to the second closed path. 

This definition of the simple points ensures the preservation of the number 
of connected components of black points (6) and of white points (7), it ensures 
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also the preservation of the tunnels in the black components by prohibiting the 
disappearance of a closed curve (8) and the appearance of a closed curve in the 
white components (9). On the basis of these properties, we will define the concept 
of simple elements of a cellular model by adding constraints for preservation of 
the loops in the white components, and by expressing these constraints using 
the definitions 1 to 9. 

Let G = {E, fs) be an abstract graph, and O a non empty subset of E. We 
note O = E — O the complementary of O. The set of simple elements of O, 
denoted by So is a subset of O such that Vs G S'o, the following conditions are 
true: 

• B X G O — {s} such that x s (10) 

• 3 X G O such that x s (11) 

• V X, j/ G O — {s}, X y ^ X y (12) 

o o+{«> 

• y x,y G O, X <-7^ y ^ X y (13) 

• V w G l7o, w is irreducible in O (14) 

3uj' G [eo\o ^ ^ 0 -{s} et u)' is irreducible in O — {s} 

• V O' G l7o-{s}) is irreducible in O — {s} (15) 

u is irreducible in O 

• y U! G oj is irreducible in O + {s} (16) 

3 oj' G such that uj' G fi-Q et to' is irreducible in O 

• V w G 17 q, to is irreducible in O =4> (17) 

ui is irreducible in O + {s} 

The conditions (10) to (13) ensure the preservation of the connected com- 
ponents of O — {s} and of O when s is removed from O and added to O; (10) 
prevents the disappearance of an isolated element of O; (11) prevents the appear- 
ance of an isolated element in O, (12) prevents the separation of one connected 
component in O and (13) prevents the fusion of two connected components of 
O. The conditions (14) to (17) prevent the appearance or the disappearance of 
a tunnel in O and in O; (14) prevents the deletion of a tunnel in O, (15) pre- 
vents the appearance of a tunnel in O; (16) prevents the appearance of a tunnel 
in O and (17) prevents the disappearance of a tunnel on O. We could define 
the expressions dual to (12) and (13) in order to prevent the separation of one 
connected component of O, and the fusion of two connected components of O. 
But since we only consider the deletion of an element of O, these conditions 
would always have been true and thus are useless. Moreover, it is possible to 
add elements to O while preserving the topology, we just have to swap O and 
O. This is why O and O + {s} as well as O and O — {s} have symmetrical roles. 
This symmetry will appear in the local characterization of the simple elements. 
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We now introduce a condition that is necessary for an abstract graph element 
to be simple. We will then show that this condition is not sufficient in the general 
case but is sufficient when applied to a cellular model. 

Proposition 1. If s is a simple element of O, then Ncc{No{s) — {s}) = 
NcciN^is)) = 1 

Proof. Let s be a simple element of O. 

If Ncc{^o{s) — {s}) = 0, the condition (10) is not verified. 

If Ncc{Nq{s)) = 0, the condition (11) is not verified. 

If Ncc{No{s) — {s}) > 2 then there exists two distinct connected components 
Cl et C2 on No{s) — {s}. Therefore, there are two possibilities: 

Either Ci and C2 are not connected in O — {s}, then 3 x G Ci and 3 y G C2 
which do not verify (12), because Ci and C2 are connected in O through s. 
Therefore s is not a simple element of O. 

Or, 3 7 G To, 3 Si G Cl, 3 S2 G C2 such that w = S1.S.S2.7 G I2o (figure 4). 
For any number n of deformations applied to w, we will alway obtain, except for 
a circular permutation, a loop such that: 

‘-^n = 7ri.S2.s.S2.7r2 where s)^ G Ci,S2 G C2 and 7ri,7r2 G Pq (18) 

Proof. Let us prove it by recurrence on n. If n = 0, we have immediately ojq 
by putting s'l = si, S2 = S2, tti = 70, 7T2 = 7. If the condition (18) is true 
for n = with N G IN then it is true for n = N + 1 because an elementary 
deformation cannot delete s from the loop, because s^ and S2 are not neighbors. 
Moreover the elementary deformation can only delete s^ (resp. S2) if there exists 
s" in Cl (resp. C2) which would become the new predecessor (resp. successor) of 
s in wat+ 1 (figure 5). Therefore any loop of [O]^ contains s and at least two other 
elements, and therefore is irreducible and cannot belong to I2o-{s}- Therefore u 
is irreducible in O and this contradicts ( 14) . Therefore, s is not a simple element 
of O. 




Fig. 4. In the neighborhood of s, there are two distinct connected components. 



If Ncci^o^s)) > 2 we can prove in the same way as before that (16) is not 
respected and therefore s is not a simple element of O. 
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Fig. 5. Two possibilities of elementary deformation of a loop. 



The reciprocal of the proposition 1 is not true. If Ncc{No{s) — {s}) = 
^cci^oi^)) = the four connectivity conditions (10) to (13) are true. More- 
over, let us show that(14) is true. If (14) were false, we would have: 

3 uj G Qq such that w is irreducible in O and such that (19) 

'i ij G f2o-{s} such that lo' G [oj\o then uj' is irreducible in O — {s}. 

If to' exists, since it is equivalent to oj and irreducible in O, then uo is irre- 
ducible in O. It contradicts (19). Therefore uj' does not exist, that means that all 
loops in [lo\q contain s and either s is isolated {Ncc{No{s) ~ {s}) = 0), or there 
are at least two distinct connected components in No{s) — {s}. It contradicts 
the assumption that Ncc{^o{s) — {s}) = Ncc{Nq{s)) = 1. Therefore (14) is 
true. 

It is possible to prove in the same way that (16) is true too. But it is possible 
to find a set E, a neighborhood relationship V and a subset O G E such that 
.^Cc(.^o(s) — {s}) = NcciN-Q^s)) = 1, and (15) is false or (17) is false (figure 6). 
This is because it is not possible to detect the appearance or disappearance of 
a tunnel with only the neighborhood of an element. To make it possible, it is 
necessary to have more assumptions about E and V. 




• Element of O 
° Element of O 



Fig. 6. (a) s is a simple element of O. (b) s is not a simple element of O be- 
cause the loop S1.S2.S3.S4 is reducible on O but not on O — {s}. In both cases, 
Ncc{No{s) - {s}) = NcciN^is)) = 1. 
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3.3 Simple cells in a cellular model 

The neighborhood of an element of a cellular model depends on the type of 
element. There are four types of elements: cubes, faces, lines and points. But if 
we see the neighborhoods as a graph where the vertices are the elements of the 
complex and the arcs are the connectivity relationships, we can notice that there 
are only two different neighborhood graphs. Figure 7 shows that the different 
neighborhood types form only two distinct polyhedron shapes when we represent 
the connectivity graph. The polyhedron are made up of triangles which provide 
the following property: 

V s G if, V Si, S2, S3 G Ne{s), Si S2 and S2 ^ S3 ^ si sg, ( 20 ) 




Neighborhood of Neighborhood of Neighborhood of Neighborhood of 

a point a segment a face a cube 



Fig. 7. The different neighborhoods of a cellular complex. Three-dimensional 
representation of the elements connected to a central element. 



Proposition 2. If the neighborhood in E of an element s of O can be rep- 
resented as a polyhedron made up with triangles, then Ncci^ois) — {s}) = 
Ncc(Nq(s)) = 1 <J 4 > s G S'©. 

Proof. We suppose Ncc{No{s) — {s}) = Ncc{Nq{s)) = 1. We have seen that 
we only have to show that the conditions (15) and (17) are true. We will prove 
it for the condition (15), the demonstration for (17) is similar. We will need the 
following lemma: 

Lemma 1. If s € E is an element whose neighborhood in E can be represented 
by a polyhedron made up with triangles and such that Ncc{No{s) — {s}) = 
Ncc(Nq(s)) = 1, we have: 

V w G ^No-{s}{s), ^ is reducible in No-{s}{s) 

Proof. In order to avoid a tiresome proof we will use a geometrical approach 
that uses recurrence on the number n of elements of Nq^Q. 

If n = 1, there exists only one element s' of O in Ne{s). Since Ne{s) can be 
represented as a polyhedron and since s' is a vertex of the polyhedron, the other 
vertices are elements of O and can be represented as a planar structure made up 
with triangles. In this structure only the external vertices are connected to an 
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element of O (figure 8). This structure represents the elements of -/Vo_{g}(s) and 
is made up with triangles that represents triplets of elements, any pair of which 
being neighbors. Since an elementary deformation of a loop is based on such a 
triplet of elements, we can deduce that any loop of No-{s}(s) is irreducible. 




Fig. 8. Planar representation of a neighborhood. 



Let us suppose that the lemma is true for n = N where TV > 2. To increment 
n by one, we must remove an element from No-{s}(s) to Nq(s). On the planar 
representation it is done by removing a point on the boundary of the structure 
because only the external points are connected to Nq{s). Thus, we do not change 
the properties of the planar structure and therefore any loop of No-{s}(s) is 
reducible. 

To prove the proposition 2, we will suppose that (15) is false: 

3 u; G l7o-{s} such that oj is irreducible in O — {s} and reducible in O. (21) 
Then: 

to is reducible in O 3 w' G n 17 atq(s)_{s} (22) 

According to the lemma, tv' is reducible in TVo(s) — {s}, therefore uj is reducible 
in O — {s}, which contradicts (21). 

4 Cellular model implementation 

The best way to implement a cellular model is to use the structure of a classical 
image and replace each voxel with the cells presented in figure 9. Thus, we can 
use a four-dimensional array indexed by (x, y, z, n) containing cells. The n index 
is called the cell number and represents the cell type. It is possible to build 
the neighborhood system for each element using offsets on the array indices. 
Each cell of the figure 9 must be provided with its own set of offsets to find 
its neighbor, thus this set depends on the cell number n. Therefore, the cellular 
model implementation is composed of the following items : 

— An array C indexed by quadruplets {x, y, z, n) and whose elements are cells. 
Indices x, y and z vary between zero and the cellular model size, n is a cell 
number and varies between 0 and 7. 
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— Height neighborhood arrays Vq, V2, V-j containing quadruplets (dx, dy, dz^ 
dn). For any element {x,y,z,n) of C, each element of Vn means that the 
element (x + dx, y + dy, z + dz, n + dn) of C is a neighbor of (x, y, z, n). 




Fig. 9. Implementation of a cellular model. 



It is clear that a cellular model is bigger than the corresponding image. 
Indeed, the implementation of a cellular model is about height times the size 
of a three-dimensional image. However, the structure we are using allows us to 
implement a cellular model in an efficient way since it is possible to represent 
a multi-dimensional array as a one-dimensional array and therefore to use an 
integer number k to identify a cell. The cell number is then deduced from k with 
a modulo operation. The neighborhoods are then four one-dimensional arrays of 
integers. As a result, algorithms using cellular models can be fast enough to deal 
with the large number of cells. 

We can use the same method to pre-calculate in an array the connections 
between the neighbors of a cell which are necessary to decide if it is simple or 
not. It is therefore very fast to detect simple cells. Moreover, in the process of 
deforming a cellular model, we only consider the simple cells. If the shape of the 
objects modeled by the cellular model is not extremely complicated, the number 
of simple cells, which are on the border of the object, is low as compared to the 
number of cells. It is therefore possible to use this homotopic deformable model 
in a practical way. 

5 Discussion and Conclusion 

We showed that the only thing one has to know to decide if an element of a 
cellular complex is simple or not, is the neighborhood of the element. The local 
criterion used to characterize simple elements is easy to implement and fast to 
compute. Therefore, it is possible to use this criterion iteratively to deform an 
object without changing its topology. However, the order in which the simple 
elements are added or deleted is important because changing the label of an 
element can change the simpleness of its neighbors. Thus, it is difficult to par- 
allelize such an algorithm and the result of a sequential algorithm can depend 
on the order used to scan the elements of the cellular complex. A good way to 
limit this order effect and to speed up the algorithm with parallelization, is to 
consider in a single pass only elements of the same dimension. Two elements of 
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the same dimension are not neighbors, so the modification of one element does 
not influence the simpleness of the others. The result of such a single pass can 
therefore be independent of the scanning order if only local criteria are used to 
change the elements’ labels. 

In order to use this model as a deformable model, it is necessary to guide 
the model towards a desired solution. One possible way to do this is to use 
a cost function to choose the elements to modify. We successfully used this 
method to segment the human cortex from MR images of the head [4]. The 
global cost function we used was a combination of several cost functions that 
can be classified in two categories: external cost functions and internal cost 
functions. External cost functions use information that is not part of the model, 
for example localization information obtained after a classification of an image. 
Internal cost functions are obtained from the model and can be used to impose 
some geometrical a priori on the solution. 

The modeling of an object with a cellular complex based model allows the 
representation of objects with different local dimensions, although such objects 
cannot be represented with conventional digital pictures. The good topological 
properties of cellular complexes allowed us to show that a cellular complex based 
object can be deformed while preserving its initial topology. The deformation 
is done by iteratively removing or adding simple elements from the object. We 
showed that a local criterion can be used to characterize the simple elements 
of an object. This simple criterion can be used to implement an homotopic 
multidimensional deformable model that can be used to solve image processing 
problems such as segmentation of objects with a complicated surface. 
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Abstract. In this paper we focus on distances between textures, and 
develop metrics on image spaces in contexts of image transformations. 
Given a metric on the range space, we can generate the initial topology 
for the domain space. For this topology we can obtain a correspond- 
ing metric using well-known metrization constructions, also providing 
granularity of the metrics. Examples are drawn from the Spatial Gray 
Level Dependency (SGLD) transformation and the application domain 
is texture recognition in medical imaging. 



1 Introduction 

In recent years there have been several attempts to define metrics for measur- 
ing distances between images or components in images. Inspired by [I], with 
questions related to finding appropriate topologies for morphological studies, 
it seems similarly natural to seek for appropriate metrics for particular image 
analysis and computer vision problems. These approaches are often focused on 
overcoming problems related to distortion given a domain space of usually grey- 
scaled images. A typical solution is given in [2], extending techniques originally 
proposed by [1], which builds upon Hausdorff metrics on classes of compact sets. 
Robustness of metrics is studied with respect to distortions. Another approach 
is provided in [3] , where neighborhoods and weightings are used to define classes 
of semi-pseudo metric spaces. The triangular inequality is sacrificed, and results 
relate to how neighborhood based metrics shift to weighting based metrics in 
the limit. 

In this paper we focus on distances between textures, and suggest to derive 
a metric using observations on transformations of images. Thus we develop met- 
rics on image spaces in contexts of image transformations, and therefore consider 
both the original domain space as well as the range space of the image trans- 
formation. For the generated metric on our domain space we obviously require 
that nearness of a pair of images in the range space reflects nearness of the 
original images in the domain space. This is to say that we require our image 
transformation to be a continuous mapping. Further we desire to have a certain 
granulation in our metric, i.e. we want the metric to be parameterized in some 
appropriate way. The construction stems from our starting point that Fuclidean 
distances or Hausdorff metrics are inappropriate in a domain space but natural 
to consider for the range space. Thus given a suitable metric on the range space. 
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we have the corresponding topology and can generate the initial topology for the 
domain space. If this initial topology is reasonably well-equipped we can obtain 
a corresponding metric using well-known topological metrization constructions. 
These constructions also provide the granulation parameters. The construction 
also relate to our cognitive notion of distances between textures. 

Our exemplifications are drawn from Spatial Gray Level Dependency (SGLD) 
transformation [4] and the application domain inspiring these developments is 
texture recognition in mammographic images [5,6]. SGLD transformations have 
been successfully used also for other imaging problems [7]. 



2 Notation and Basic Constructions 



Let X denote a domain space of images, and let L be a set of ’pixel values’. 
Thus an image is a function / : — > L. The space, or set, of such images, or 

functions, is denoted F{X, L). An image transformation t is a mapping between 
image spaces, i.e. t : F{X,L) F{Y,M). In a 2-dimensional grey-scale image 
we would have X = A x B and L would be a linearly ordered complete lattice. 

For our formal constructions we obviously need to consider underlying mea- 
sure spaces and corresponding integrability of images as functions. However, our 
constructions are based on a rather informal mathematical treatment, and there- 
fore these considerations are left out. When restricted to the real-valued case, 
the constructions can be seen to build upon Lebesque measures and integrability. 
For further background to topologies and (pseudo-)metrics, see e.g. [8]. 

The pointwise defined metric dp on F{X, L) is given by 

rfp(/i,/2) = (^i/i-/2r) (1) 

The typical example demonstrating the drawback of this metric is the following. 
Let X = A X B, where A = {oi, . . . , 0^} and B = {bi, . . . , 6„}, i.e. A be a grid 
of pixels, and further fi'.X^ [0, 1] be defined by 



fi{az,bj) 



l,'\i i + j is even, 
0, otherwise. 



and f 2 ■ X ^ [0, 1] be defined by 



(2) 



h{ai,bj) 



1, if z -I- j is odd, 
0, otherwise. 



(3) 



i.e. fi and /2 represents m x n ’chessboards’ with, respectively, white and 
black in the upper left corner, see Figure 1. 

Then (/^ I /i — /2 = (/^ which clearly is undesirable. In order 

to introduce metrics generated using transformations, consider first the mapping 
f : X ^ Y, and a (pseudo-) metric {Y, e). Then the mapping d : X xX defined by 
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Fig. 1. Chessboard texture 



d{xi,X 2 ) = e{f{xi),f{x 2 )) is a (pseudo-)metric for X, and / : (X,T) (Y,ld) 
is continuous, where T and U are the corresponding metric topologies generated 
by d and e, respectively. 

Note that these constructions do not involve any granulations. Usually, some 
granulation features are desirable from application point of view. Granulation 
can be achieved either using neighbourhoods of points in X, or, as mentioned 
above, using metrization techniques related to the initial topology on the domain 
space given by the metric topology on the target space. Details are presented in 
the following section. 

3 Metric Image Spaces 

In this section we will construct metrics on F{X, L), given image transformations 

t:F{X,L)^F{Y,M) (4) 

and metrics on F{Y, M). Our constraint to have nearness of images fi G F{X, L) 
being related to nearness of corresponding transformed images r(/i) G F{Y, M) 
will be guaranteed by the requirement that r is continuous. 

Throughout this section we will assume F{Y,M) to be equipped with some 
metric 

ep{gi,g 2 ) = \g 1 -g 2 • (5) 

3.1 Granulation Using Neighborhoods 

Assume that Y has an underlying metric. For g G F{Y, M), let : F{Y, M) — > 
F{Y,M) provide a modification of g so that K^"'\g){y) combines values of g{y'), 
y' G B{y,^), in some appropriate way. 

A particular choice for k might be 



K^'*Hg){y) 



(6) 
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i.e. the mean value of the neighborhood. This a low-pass filtering of g, which 
means that high frequencies will be suppressed in the metric. This in turn means 
that we obtain a metric that is relatively robust to noise. 

Another choice of n is 



K^'’Hg){y)= sup g{y'). ( 7 ) 



Taking the supremum of the neighborhood might in some applications be moti- 
vated. This is mainly true if our t has a very good locality, that is, it concentrates 
t(/) to some points in Y. In these cases it is not desirable to smear these points 
by using a mean value. 

We can now define a metric on F{X, L) through 

I K^^\r{h)) - K^^\r{h)) fY" . ( 8 ) 



We will use c*^°^(/i,/2) to mean (fy |r(/i) — t(/ 2)|^)^^^. A normalization of c 
can be given as 






(fyiy/P ■ 



( 9 ) 



Constructions in [3] correspond to using 7) I T(fi)(m)-T(f2)(y2) |, 

as compared to our | k*-^H'^(/i))(2/) ~ y'^H'’'(f2))(y) I- We have that 7 > 0 and 



y°Hfi,f2) = doo(r(/i),r(/ 2 )) (10) 

since limp^oo dp(/i, /2) = sup(/i, /2) Note that this holds regardless of the 
used. 

Similarly, 7 < d{Y) = sup{d(yi, 7/2) : yi,2/2 € T} and 



^ |k(‘^W))(.t(/i)) - K(‘^W)(.r(/2))|,as p^oo. (11) 

If 7 = d{Y), y'*'>{T{f)) will be constant over the whole Y. 

3.2 Metrization Based Granulation 

The granulation technique is based on a construction, evolving from Urysohn’s 
Lemma [ 9 ], that is used to prove the topological metrization theorem: A topology 
is metrizable if and only if it is Ti, regular, and has a cr-locally finite base. 

Recall that any (pseudo-)metric space (X,d) generates a topology (A, T), 
where the open r-spheres B{x,r) = {y € X \ d{x,y) < r}, a; G A, r > 0 , form a 
base for T. 
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For the purpose of our constructions, we will assume the existence of a rea- 
sonably large set T C F{Y,M) so that each family 

Br = ^T-\B{g^-))\g&TQF{Y,M)^ (12) 

is locally finite and B = is a cr-locally finite base for a topology on 

F{X,L). 

The metrics of F{X, L) can now be defined as constructions in [8] related 
to proving the metrization theorem: 



dr,sifij2) = I ‘Puifi) - ^uif2) I, (13) 

ueBr 



where 



and 



‘Puif) 



l,if / G [/' C [7 

0 ,if /^ [/ 



f/'= U V 

veBs,v^u 



(14) 



(15) 



It can be shown that (F{X,L),dr,s) is a (pseudo-)metric space for all r,s G N. 
We have r < s, and it is intuitively clear that the ’gap’ between U' and the 
complement of U closes up as r approaches s. We will use 7 = 1 to denote 
the intuitive counterpart of dr^s, where the ’gap’ between U' and the complement 
of U is assumed to vanish or be negligible. Thus we arrive at our proposal for a 
metric on F{X,L): 



d^''HfiJ 2 ) = card(B(r(/i),7) U B(r(/2),7) ~ B(t(/i),7) H B(t(/ 2),7)) (16) 
where card(-) is the cardinality for a set. 

3.3 Implementation 

The implementation of the c*- '’'^-metric is rather straightforward and will not be 
treated in this section. 

The implementation of the d('>')-metric is on the other hand more elaborated, 
this is because the idea of the metric is to count the number of functions / G 
F{X, L) that, by the transformation r, is taken into either a surrounding of t(/i) 
or t(/ 2). The larger this set is the more /i and /2 differ from each other. Of 
course, it is a non trivial task to actually implement this particular metric since 
we have no notion of how to generate the surrounding of a point g G F{Y, M) in 
the general practical case. This is due to the fact that each point g is actually a 
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function g : Y —> M that in turn is implemented as a matrix. We can, however, 
calculate the distance between two points gi and 32 with the Cp metric and from 
this we are able to approximate the metric according to: 

d^^HhJ 2 )= . ( 17 ) 

The distance between r(/i) and r(/2) is related to the desired granularity 
7. To raise the result to the power of p approximates the dimensionality of the 
function space F{Y, M). 

Another approximation of is 



d^'^n/1,/2) 



l,if ep(r(/i),r(/2)) > 2y 
1 — f (f)) sin (j )) , otherwise 



( 18 ) 



where (j> = 2arccos ^ This is a fair approximation, provided we 

consider 7?(t(/i),7) and B(r(/2),7) as 2 -dimensional circles with radius 7. The 
distance is the non-overlapping area of the two circles normalized to the [0, 1] 
interval. If i?(r(/i),7) and B(r(/2),7) ^re identical, there is no non-overlapping 
area and the distance is zero. If the distance between r(/i) and r(/2) is more 
than 27 there will be no overlapping area and reaches its maximum value. 

Of the suggested approximations of the first is to prefer due to its gen- 
erality and computational properties. 



4 The T— Transformation 

In our formal constructions, we see how each metric respectively are dependent 
on the transformations r : F{X,L) F{Y,M). In the following subsections we 
will exemplify with specific choices for r. 

There are a number of suitable r:s that might be used. In this article we 
will concentrate on the SGLD as explained below. It is however stressed that 
basically any transform might be used. What transform to use depends on the 
application. 

Spatial grey-level decencies (SGLD) were developed in [ 4 ] for handling statis- 
tical features of textures. We will use the corresponding image transformation 
as one of our case studies for r. However, we will use r in a more general setting 
as compared to the original construction defined in [ 4 ]. 

Let X be the domain for the image, and let L be the set of grey levels. As 
mentioned earlier an image is given as a function / : X — > L. 

A pattern, corresponding to a texture, is given by tt : X — > X, or generally 
by 7T C X X X, or even tt C X". 

A notational convenience is given by F{L‘^, { 0 , 1 }), where 
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c(f1,f2, gamma) 

g1=t(f1) 1| ^ ^ 1 

0.5 

0 

J0.5 

J1 I ■ ^ 

0 2 4 6 

gamma 

d(f1 ,f2, gamma) 

g2=t(f2) 1 1 . . 1 

0.5 

0 

J0.5 

J1 I ■ ^ 

0 2 4 6 

gamma 





Fig. 2. The distances between the two chess boards textures. The distance is zero regard- 
less the granularity used (right column). The SGLD-transform is shown in the center 
column. 



i{ai,a2){h, h) 



1, if Oi — ^1, G2 — 
0, otherwise. 



(19) 



Generalizing l to involve not only crisp values, we have L" F{L^, [0, 1]) 
where t(a) : L” — > [0, 1] should satisfy = 1. This reveals the potentials of 

using t in this way. 

Given tt and t, we can now define the transformation t = t(tt, l) : F{X, L) — > 
F{L^^R) through 

T{f){ai,...,an)= ^ t(ai,...,a„)(/(xi),...,/(a;„)). (20) 

Previously it was shown that two chess board patterns were given the maxi- 
mum distance using the dp-metric. By applying and this is avoided, see 
Figure 2. In this example is Tr(xij) = Xi+ij used. 

By introducing noise we disturb the statistical relationship between points 
in the texture. This implies that the textures that produced points in some 
SGLD transformation will produce smeared points when corrupted with noise, 
see Figure 3. The intensity around one of these points has the same distribution 
as the noise. Hence it can be seen that the noise in Figure 3 is of Gaussian type. 

An example of a more general r is to use the magnitude of the Fourier coeffi- 
cients, that is |/|. This representation is, however, not invariant under rotation, 
but by transforming |/| to a polar co-ordinate system the main direction of the 
texture is easily obtained. This is done since a main direction of the texture in 
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g1=t(f1) 




g2=t(f2) 




c(f1,f2, gamma) 





Fig. 3. The two chess boards textures corrupted with noise (left column). Note that 
the measured distance decreases with increased granularity (right column). The SGLD- 
transform is shown in the center column. 



an image in the F(X, L)-domain generates a linear structure in the F{Y,M)~ 
domain which, when in polar form clearly enhances the angle of the texture (or 
actually the angle orthogonal to the texture). Hence the energy as a function of 
texture angle is obtained as 

= J fp{p,(l))dp (21) 

where fp(p, 4>) is the polar representation of |/|. In the practical case this means 
that the two-dimensional image is reduced to an one-dimensional function. To 
obtain full rotational invariance it is possible to view a{4>) as a periodic function 
with period 27 t and to define 



a(0) := a{<p- p,), 



where p satisfies 



Qf(/i) = sup a{4>). 



( 22 ) 



(23) 



The definition in equation 22 ensures that o;(0) is the supremum of the energy 
function and hence it is meaningful to compare a’s by an ordinary dp metric. 



5 Results 

In this section properties of and will be further examined by some 
examples. The metrics are tested on four images shown in Figure 4. All images 
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Fig. 4. Images A and B on top left respectively right and images C and D on bottom 
left and right respectively. 



have the same mean values and variance of their pixel values. Image D in Figure 
4 is a combination of all the other images, A to C. In tables 5, 6 and 7 the 
distances between noisy images are shown. The Gaussian noise is added so that 
the mean values of the images are preserved and the level of noise is 10% of the 
maximal pixel value. All images are 78 x 156 pixels. 

The numerical value of the distance is somewhat irrelevant, since it is highly 
dependent on the normalization used. However, the ’close-far’ relations, with cor- 
responding ranking indicated in the tables, between the images does not change 
by normalization. 

From table 3, it is clear that the dp metric taken on the original images does 
not separate the images at all. Hence, this metric will not be considered any 
more in this discussion. 

The transformation used is a SGLD with Tr{xij) = for all images. This 

transformation is most sensitive for textures in the vertical plane and hence is it 
most suitable for image A in Figure 4. Given tt, we expect the images A and C 
to be the most different. This is also the case for our metrics. Furthermore we 
expect A and B to be more similar than B and C, since the main texture in B 
is vertical while it is horizontal in C. However d{A, B) < d{B, C) is not fulfilled 
by the di metric. 

It would be desirable that a metric preserves ’close-far’ distances between 
images, even when they are corrupted with noise. From tables 5, 6 and 7 it can 
be seen that d^^'l preserves this order in a best way. This is expected since the 
d%%metric considers the whole range space when calculating the distances. This 
differs from the dp- and c^'^'^-metrics which consider point by point in the Y space 
instead of the F{Y,M) space. 

The conclusion of these observations is that the proposed and d%^-metric 
with a suitableSGLD-transform correspond to some extent to our cognitive sense 
of distance between images. 
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Table 1. -distances. Within 
parenthesis: the ’close-far’-rank. 





BCD 


A 


(2): 0.13 (6): 0.24 (3): 0.16 


B 


(5): 0.18 (1): 0.05 


C 


(4): 0.17 



Table 2. d^^'^ -distances. 




B C 


D 


A 


(3): 0.24 (5): 0.30 (2) 


0.23 


B 


- (4): 0.25 (1) 


0.015 


C 


- (4) 


0.25 



Table 3. f2)-distances. Table 4. di{T{fi),T{f2))-distances. 



di 


BCD 


di 


BCD 


A 


(2): 0.30 (2): 0.30 (1): 0.20 


A 


(4): 0.58 (6): 0.75 (5): 0.62 


B 


(2): 0.30 (1): 0.20 


B 


(3): 0.37 (1): 0.13 


C 


(1): 0.20 


C 


(2): 0.29 



Table 5. f 2 )-distances be- 

tween noisy images. 





BCD 


A 


(1): 0.25 (4): 0.35 (2): 0.26 


B 


(5): 0.38 (3): 0.29 


C 


(3): 0.29 



Table 6. d^^\fi, f 2 ) -distances 

between noisy images. 





BCD 


A 


(2): 0.13 (6): 0.36 (3): 0.16 


B 


(5): 0.34 (1): 0.049 


C 


(4): 0.20 



Table 7. ,T{f2))-distances between noisy images. 



A 

A 

B 

C 



BCD 
(4): 0.23 (6): 0.59 (2): 0.20 
(3): 0.21 (1): 0.095 
(5): 0.29 
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6 Conclusions and Further Work 

Metric image spaces are introduced with respect to corresponding image trans- 
formations. It is shown how a granulation can be introduced using neighbour- 
hoods in domain and range image spaces, respectively. The metrics have comple- 
mentary properties, and together they introduce a metrization typically useful 
in presence of transformations related to texture analysis. 

Further work consists of finding better approximation of the d^^^-metric. We 
also need to add to our understanding what kind of overall texture information 
is represented by a given g G F(V,M). In addition, the neighborhood of g also 
needs an improved analytical description. 
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Abstract. On a collection of subsets of a space, fundamentally differ- 
ent metrics may be defined. In pattern matching, it is often required 
that a metric is invariant for a given transformation group. In addi- 
tion, a pattern metric should be robust for defects in patterns caused by 
discretisation and unreliable feature detection. Furthermore, a pattern 
metric should have sufficient discriminative power. We formalise these 
properties by presenting five axioms. Finding invariant metrics without 
requiring such axioms is a trivial problem. Using our axioms, we analyse 
various pattern metrics, including the Hausdorff distance and the sym- 
metric difference. Finally, we present the reflection metric. This metric is 
defined on finite unions of n — 1-dimensional hyper-surfaces in IR”. The 
reflection metric is affine invariant and satisfies our axioms. 



1 Introduction 

For any collection of patterns, subsets of a space, there exist many possible 
metrics. In pattern matching applications, it is often desired that such a metric is 
invariant for a given transformation group. The shape of a pattern can be defined 
as the pattern modulo the action of this invariance group. By minimisation of a 
pattern metric under the invariance group, one obtains a metric on shapes, see 
Rucklidge [13]. If to be used in pattern matching, a pattern metric should be 
robust for the effects that occur with discretisation, see Fig. 1(a). In addition, a 
metric should have sufficient discriminative power. 

Many pattern matching algorithms use a metric on simple closed curves. 
An important example is the Frechet distance, see Alt and Godau [3]. Other 
pattern metrics for boundary curves are based on turning angle, see Cohen and 
Guibas [5], or normalised affine arc-length, see Huttenlocher and Kedem [10]. 

The Hausdorff metric is defined on the collection of all (non-empty) closed, 
bounded subsets of a metric space. Some algorithms are based on this metric, see 
[4], [11], [1]. However, the Hausdorff metric is not robust with respect to certain 
types of noise. For example, outliers, i.e. isolated points lying far away from 
the other points, can cause a dramatic increase in the Hausdorff distance. The 
Hausdorff metric is invariant for the group of isometries. The partial Hausdorff 
distance is a non-metric variant of the Hausdorff metric that is more robust 
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for noise, see [12], [7]. The partial Hausdorff distance depends on a parameter 
estimating the amount of distortion. 

For solid patterns, robust affine invariant metrics exist. Examples include 
the normalised volume of symmetric difference, and the difference of normalised 
indicators. The volume of symmetric difference was previously used by Alt et 
al. [2]. We introduced the difference of normalised indicators in [8]. 

Until now, little attention has been paid to affine invariant metrics on finite 
unions of curves in IR^, or surfaces in IR^. In this paper, we present the reflection 
metric. This metric is defined on finite unions of (n — l)-dimensional hyper- 
surfaces in ]R". The reffection metric is invariant under affine transformations. 
We show that the reffection metric is robust in many respects. 

Section 2 presents axioms that express properties of pattern metrics, such as 
robustness. The general formulation allows the axioms to be verified for collec- 
tions of patterns in any underlying topological space. Finding a metric satisfying 
a number of axioms becomes more difficult, or may even become impossible, as 
more general classes of patterns and larger invariance groups are desired. 

Section 3 analyses various pattern metrics with respect to the domain of 
definition, the invariance group, and satisfaction of the axioms. We deal with 
the discrete metric, the Hausdorff metric, the (normalised) volume of symmetric 
difference, and the difference of normalised indicators. Lastly, we introduce the 
reflection metric. 

2 Axioms 

Definition 1. If X is a topological space, V is a collection of subsets of X, and 
d is a metric on V, then we call the structure {X,V,d) a metric pattern space. 

The elements of V will be called patterns. The value d{A, B) is the distance 
between the patterns A and B. 

A collection of patterns V uniquely determines a maximal subgroup T of the 
homeomorphisms Hom(A) under which V is closed. The subgroup T consists of 
all t € Hom(A) such that both the image t{A) and the inverse image t~^{A) are 
members of V for all A G "P. 

The metric pattern space {X,V,d) is invariant for a transformation g G 
T if d{g{A),g{B)) equals d{A,B) for all A,B G V. The invariance group G 
of a metric pattern space consists of all transformations in T for which it is 
invariant. Figure 1(b) shows patterns A and B in the Euclidean plane, and 
image patterns g{A) and g{B) under an affine transformation g. Invariance for 
affine transformations makes the distance between two patterns independent of 
the choice of coordinate system. 

Below, we present axioms for metric pattern spaces. The first four axioms 
express various types of robustness. These robustness axioms are preserved if 
the collection of patterns is restricted, or the patterns are replaced with their 
complements. The fifth and last axiom is a property that can be verified for 
metric spaces in general, enforcing a mild form of discrimination. 
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Fig. 1. (a) Discretisation effects (b) Invariance. 



First, we discuss an axiom which states that transformations t “close” to 
the identity, result in a “small” distance d{A,t{A)). In a topological space, this 
kind of closeness can be described by considering small open neighbourhoods of 
a point. For this purpose, we give the group of transformations T the compact- 
open topology, which is defined in terms of the topology on X . This way, T can 
be considered a topological space in which transformations are points, and each 
transformation has open neighbourhoods that are sets of transformations. 

The compact-open topology is characterised by its collection of open sets. A 
collection of subsets 5 of a space S whose union equals S, is called a subbasis 
for a topology on a space S. The topology generated by the subbasis S consists 
of all unions of finite intersections of elements of S. The (relative) compact-open 
topology on the set of transformations T is generated by the subbasis consisting 
of all sets of the form 



S{K,U) = {teT\t{K)CU} , (1) 

where K C X is compact, and U C X is open. 

The metric pattern space (X,V,d) is called deformation robust if it satisfies 
the following axiom: 

Axiom 1. For each A G V and e > 0, an open neighbourhood I C T of the 
identity transformation exists such that d{A,t{A)) < e for all t G I. 

Deformation robustness is equivalent to saying that the map t i-^- t{A) with 
domain T and range V is continuous, for each pattern A G V. Figure 2(a) shows 
the image of A under a transformation t contained in an open neighbourhood 
of the identity. This neighbourhood is a finite intersection of subbasis elements 
S{Ki,Ui) generated by compact segments Ki of A and open balls Ui containing 
A,. 

Define the boundary of a subset A of A by Bd(A) = C1(A) — Int(A), where 
C1(A) and Int(A) denote the closure and the interior of A in X, respectively. We 
call a metric pattern space blur robust if the following axiom holds: 
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Axiom 2. For each A G V and e > 0, an open neighbourhood U of Bd(A) 
exists, such that d{A, B) < e for all B G V satisfying B — U = A — U and 
Bd(A) C Bd(B). 

The axiom says that changes “close” to the boundary of A do not cause discon- 
tinuities. This way, the axiom ensures that “shadow” or “double” edges do not 
significantly affect the distance between patterns. Figure 2(b) shows a neigh- 
bourhood U of Bd(A) in which parts of Bd(i3) occur that are not in Bd(A). 




(b) 

Fig. 2. (a) Deformation robustness (b) Blur robustness. 



A crack of A is a closed subset R oi X consisting entirely of limit points 
of Bd(A) — R. This means that all open neighbourhoods of a point x G R 
intersect Bd(A) — R. Cracks are parts of the boundary, that can be “restored” 
after having been removed from the boundary, by taking the closure of the 
remaining boundary. Changing a pattern in neighbourhoods of a crack may 
cause the pattern (or its complement) to become separated or connected. We 
say {X,V, d) is crack robust if the next axiom holds: 

Axiom 3. For each A G V , each crack R of A, and c > 0, an open neigh- 
bourhood U of R exists such that A — U = B — U implies d{A, B) < e for all 
BgV. 

This axiom says that applying changes to A within a small enough neighbour- 
hood of a crack of A results in a pattern B close to A in pattern space. Whether 
the connectedness is preserved does not matter. 

Figure 3(a) shows a pretzel, consisting of two topological 1-spheres glued 
together at a point x. The set i? = { a; } is a crack of A. This situation is similar 
to a saddle-point event encountered while varying the threshold during image 
segmentation. 

If the following axiom is satisfied, we call a metric pattern space noise robust: 



Axiom 4. For each A G V , x G X , and e > 0, an open neighbourhood U of x 
exists such that B — U = A — U implies d{A, B) < e for all B G V. 
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This axiom says that changes do not cause discontinuities provided the changes 
happen within small regions. ^From the triangle inequality it follows that an 
equivalent axiom is obtained when neighbourhoods of finite point sets instead of 
singletons are considered. In a practical sense, satisfaction of the axiom means 
that the metric is well-behaved with respect to “outliers” and “occlusion” . An- 
other interpretation is that the metric is robust for “emerging” or “disappearing” 
regions. 

Figure 3(b) shows a pawn-like pattern A and a point x. Addition of noise 
B — A within a neighbourhood U oi x results in a new pattern B. Axiom 4 says 
that we can make the distance between A and B smaller by making U smaller. 




Fig. 3. (a) Crack robustness (b) Noise robustness. 



Each of the previous axioms would still hold if we replaced the metric d with 
a bounded metric p{A,B) = min(c, d(A, i?)) for any fixed c > 0. The following 
axiom prohibits “cut-off” bounding of metrics in this manner. We say (X,V,d) 
is strictly increases if the following is true: 

Axiom 5. For any A,B £ V, a C £V exists such that d{A, C) > d{A, B). 

This axiom expresses a form of discrimination: Given two patterns, there should 
always be a third pattern such that the distance between the first and the third 
pattern is larger than the distance between the first and the second pattern. 

3 Examples 

In this section, we analyse examples of metric pattern spaces, using the proper- 
ties given in Section 2. In each example, we fix the space X and the collection of 
patterns V . Therefore, we will say that an axiom holds for a metric on d, when 
we actually mean that the axiom holds for the metric pattern space (A, 7^, d). In 
the following, we will discuss the discrete metric, the Hausdorff metric, the vol- 
ume of symmetric difference, the normalised volume of symmetric difference, the 
difference of normalised indicators. After that, we present the reflection metric. 
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3.1 Discrete Metric 



The following example, the discrete metric, illustrates that it is no problem 
finding a metric defined on any collection of patterns invariant under any group 
of homeomorphisms in a topological space X. Only demanding the satisfaction 
of axioms, makes the problem of finding “invariant” metrics non-trivial. 

Let the collection of patterns be p(IR"), the power set of IR". 

Definition 2. The discrete metric on p(IR") is given by 



du{A, B) 



0 if A= B 

1 otherwise 



(2) 



The discrete metric is invariant for all homeomorphisms. However, this metric 
satisfies none of our axioms. Below, we provide simple examples contradicting 
each of the axioms. 

Deformation robustness does not hold because non-empty open subsets of K” 
always contain more than one point. Choose A = { x }, where x is an arbitrary 
point in IR", and choose e = 1. Then, d{A,t{A)) < e if and only if t{x) = x. It 
can be shown that any open / containing the identity, contains a transformation 
t for which t{x) yf x, implying d{A,t{A)) = 1. 

The following example contradicts blur robustness. Choose A = {x} and 
e = 1. For any neighbourhood U of x, choose B = \ v,x\, where u yf x is in [/. 
ThenA^B sod:{A,B) = l. 

Using the unit sphere in IR^, a counterexample for crack robustness can 
be constructed. Choose H = S'^, and let R consist of a single point on S^. Clearly 
R C Cl(Bd(H) — R). Now let e = 1. For any open neighbourhood U of R, choose 
B = A — R, giving di){A, B) = 1. 

A counterexample for noise robustness is obtained by choosing A = { x } 
and e = 1. For any neighbourhood of U, we can always choose B = 0 giving 
du{A,B) = 1. 

Agreeing with intuition, the discrete metric is not strictly increasing. The 
distance between any two distinct patterns always equals one. 



3.2 Hausdorff Metric 

The Hausdorff metric is defined on the collection of (non-empty) bounded, closed 
subsets of any given metric space X. Here, we consider the Hausdorff distance for 
Euclidean space X = IR". The collection of patterns /C" consists of all non-empty 
compact subsets of IR". 

Let B{x, e) denote the open Euclidean ball in IR" centred at x G IR" with 
radius e > 0. For any subset S of IR", define the e-neighbourhood of S in IR" as 

5^=|jH(x,e). (3) 

xGS 

The following definition uses these e-neighbourhoods. 




292 Michiel Hagedoorn and Remco Veltkamp 



Definition 3. The Hausdorff metric on KT is given by 

dn{A, B) = inf{ e>Q\ACB^ andBCA^} . (4) 

It is not difficult to show that the invariance group for the Hausdorff metric 
equals Iso(IR"), the group of isometries in M". 

The Hausdorff metric is deformation robust. Let A G /C" and e > 0 be given. 
Cover A with a finite number of balls { H(ci, 5), . . . ,b(ck,S)j each having radius 
i5 = |e. Define an open set of transformations as a finite intersection of subbasis 
elements: 

k 

/= f|{teHom(]R”) |t(Cl(H(c„5)))CH(Q,2<5)} . (5) 

i=l 

Clearly, dn{A,t{A)) <26 < e for each t G I. 

We prove blur-robustness for the Hausdorff metric. Let A G /C" and e > 0. 
Choose 6 = ^ and U = Bd(H)'^. Suppose B G V satisfies B — U = A — U and 
Bd(H) C Bd(H). Since A C B^ and B C A^ , we have d-n{A,B) < 5 < e. 

The Hausdorff metric is crack robust. Let A C M” be non-empty compact 
and i? be a crack of A. Set 5 = Cover R with a finite collection of balls 
centred at points of R, and having radius 6\ {Bi,... ,5^}. For each ball Bi 
choose a point Xi in the intersection with Bd(H) — R. Construct an open set 
containing R by setting U = UtiS.- {xi, . . . ,Xk}- Then dn{A, B) <26 < e 
for any H G 5” satisfying B — U = A — U. 

A counterexample for noise robustness can be constructed in the real line. 
Choose A = {0},a;=l, and e = 1. For any neighbourhood [/ of 1, choosing 
i? = { 0, 1 } gives A — U = B — U and du{A, B) = e. 

The Hausdorff metric is strictly increasing. Let A,B G /C". Let e = d^i{A, B). 
Choose a point c G IR" such that infa,gA l|a: — c|| > e. Clearly dn{A,BLl{ c}) > e. 

3.3 Volume of Symmetric Difference 

Let V = { wq: • ■ • ) I'fe } be a pointwise independent subset of K”. The closed 
k-simplex determined by V is the set of points in the fc-dimensional hyper-plane 
containing V having nonnegative barycentric coordinates with respect to V. 
Two simplices are properly joined if their intersection is a subsimplex of both 
simplexes. Define 5" as the collection of non-empty subsets of M" that are C^- 
diffeomorphic with a finite union of properly joined n-simplices. 

For a compact set A C IR”, let vol(A) denote the volume of A which equals 
the Lebesgue-integral of the indicator 1^. 

Definition 4. The volume of symmetric difference on 5” is given by 

ds{A, B) = vol{{A - B) U {B - A)) . (6) 

The function c?s is obviously a semimetric. The fact that ds is a metric on 5” 
follows from the fact that Int(A) — H is non-empty and open if A is not a subset 
of B, and A, B G 5". 
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The collection of patterns 5” is closed under C^(IR”), the group of diffeo- 
morphisms on IR”. It can be shown that ds has invariance group UJ(IR”), the 
class of diffeomorphisms with unit magnitude Jacobi-determinant everywhere. 
These transformations preserve the volume of each set. The special affine trans- 
formations form a proper subgroup of UJ(IR") 

The collection of patterns 5" is defined such that for each A G S", an 
arbitrary small neighbourhood of the boundary of A can be chosen: For each 
e > 0, an open neighbourhood U of Bd(^) exists with vol(C7) < e. This property 
is essential in proving the deformation, blur and crack robustness axioms. 

The volume of symmetric difference is deformation robust. Let A € S" and 
e > 0. Choose a neighbourhood U of Bd(^) with volume strictly smaller than e. 
An open neighbourhood of the identity transformation is given by 

/= e C^(]R”) I t(Bd(A)) C [/} . (7) 

Suppose t G I. Since both A — t{A) and t{A) — A are contained in U, it follows 
that ds{A,t{A)) < vol(C7) < e. 

The symmetric difference is blur robust. For given A G 5" and e > 0, choose 
some open neighbourhood U of Bd(A) with volume smaller than e. Clearly, 
ds{A, B) < e for any B G satisfying B — U = A — U . 

We can also prove crack robustness by choosing neighbourhoods with suf- 
ficiently small volumes. For A G 5", crack R of A, and e > 0, choose an 
open neighbourhood U of R with volume less than e. Any B G 5" satisfying 
B — U = A — U gives ds(A, B) < e. 

Proving noise robustness is almost the same. Let A G S'^, x G K”, and e > 0. 
Choose a neighbourhood U oi x with volume smaller than e, then A—U = B — U 
implies ds(A, B) < e for all B G S^. 

We show that the volume of symmetric difference is strictly increasing. Let 
A,B G 5”. Let S' be a closed n-simplex, disjoint with ALIB. Then, ds{A, BLiS) > 
ds{A, B). 



3.4 Normalised Volume of Symmetric Difference 

Definition 5. The normalised volume of symmetric difference on S" is given 
by 



dNs(A, B) 



vol{{A - B) U {B - A)) 
vol(A U B) 



(8) 



The function dus is a metric with invariance group CJ(IR”), the class of dif- 
feomorphisms with constant magnitude Jacobi-determinant everywhere. These 
transformations preserve the ratio of volume for any two sets. The affine trans- 
formations Af(IR") form a proper subgroup of CJ(IR"). 

For proving deformation, blur, crack and noise robustness, it suffices to show 
that the topology induced by the metric ds is finer than that induced by that of 
dus. This is done in the following simple lemma. 
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Lemma 1. For each A G S'^ and each e > 0, there exists a S > 0 such that 
ds{A, B) < 6 implies fins (A, B) < e for all B G . 

We are left with showing that cIns is strictly increasing. For this purpose, the 
construction given for the volume of symmetric difference suffices. 

3.5 Difference of Normalised Indicators 

For A G S"^, define a function i/a ■ ^ IR by lyA(x) = vol(A)“^l^(a;). 

Definition 6. The difference of normalised indicators dm for 5" is given by 

dm{A,B)= / \irA{x) - iyB{x)\ dx . (9) 

d]R" 

The invariance group of dm is CJ(IR"). 

The following lemma is the analogue of Lemma 1 for the difference of nor- 
malised indicators. It shows that the topology dg induces on 5" is finer than 
that induced by dm. This immediately gives deformation, blur, crack, and noise 
robustness for dm. 

Lemma 2. For each A G S"' and each e > 0, there exists o d > 0 such that 
ds{A, B) < 6 implies dm(2l, B) < e for all B G . 

This leaves us to show that the normalised indicator difference is strictly 
increasing. For any pair of patterns A, B G 5”, we can add a closed n-simplex S 
to B, disjoint with ALIB, resulting in a set C G 5", with dm(A, C) > dNI{A, B). 

3.6 Reflection Metric 

Let 7^" be the collection of subsets of M” (not contained in any (n — 1)- 
dimensional hyper-plane) that are C'^-diffeomorphic to a properly joined union 
of closed (n — l)-simplices. Formally, we write each pattern A G TiA as A = 
4>{yJi=i Si), where S\,. . . , Sk are properly joined closed (n — l)-simplices and 
0G CHK”). 

We use the notation xy for the open line segment connecting two distinct 
points x,y G IR". We say that a point y G IR” is visible (in A) from a point 
X G IR" if AC\Ty = 0. For A G TZn and x G IR", the visibility star Rf is defined 
as the set of open line segments connecting points of A that are visible from x: 

Vff = y_j{xd\ a G A and AOWi = & } . (10) 

We define the reflection star Ra by intersecting with its reflection in x: 

= {x + v gM^ \ X - G and a; -k u G } . (11) 

Each pattern A G RA determines a function pA ■ K" ^ IR given by pa{x) = 

vol(i?^). Note that pA is zero outside the convex hull of A. Figure 4 shows the 
visibility star Ef and the corresponding reflection star R\ for a pattern A G R^, 
and a point x G IR^. 
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Fig. 4. Reflected visibility. 



Definition 7. The reflection metric for A,Bg 7^" is given by 

\pa(x) - pb(x)\ dx 

dR[A,B) — j — . ( 12 ) 

J^^Ttiax[pA[x),pB[x)) dx 

The reflection metric is invariant under the affine transformations Af(IR"). 
Observe that for any two patterns A,Bg 7^" and any point x € IR”: 

vol(R^ -R%)<2 vol(V2 - VS) . (13) 

),From this, we And that 

\pa{x) - pb{x)\ < 2 vol((FI - V§) U {VS - FD) • (14) 

Thus, we can prove the first four axioms by bounding the change in the visibility 
star for deformation, blur, crack and noise. 

The metric d^ is deformation robust. Let A G 7^”, where A = <('(U^=i ^i)- Let 
Qi,. . . ,Qihe the set of (n — 2)-subsimplices of the (n — l)-simplices . . . , Sk- 
Let Ui be an open neighbourhood of A with volume strictly smaller than a given 
number (5i > 0. Let L = <()(lj-^i Qi) and let U 2 be an open neighbourhood of L 
with volume smaller than a given number (52 > 0. Define an open neighbourhood 
of the identity: 



I={tG C^(]R”) I t{A) C Ui and t{L) CU2 } ■ (15) 

Given any e > 0, we apply Eq. 14, in choosing (5i > (52 > 0 small enough so that 
dR{A, t{A)) < e. 

The reflection metric is blur robust. Let A G TiA and e > 0 be given. Choose 
an open neighbourhood U oi A with vol(C/) < 6 for some given 5 > 0. Using 
Eq. 14, choose (5 > 0 small enough so that for all B G TiA satisfying B—U = A—U 
and A C B, the distance c?r(21, B) is smaller than e. 

The reflection metric is crack robust. Let A G 7^", i? be a crack of A, and 
e > 0. By means of Eq. 14, we can choose a sufficiently small open neighbourhood 
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U of the crack R such that B) < e for patterns B G 7^” satisfying B — U = 
A-U. 

The reflection metric is noise robust. Let A G 7^” and x G K” be given. 
Using Eq. 14, we can choose an open neighbourhood U of x small enough such 
that dR(Gl, B) < 5 for all B G RA satisfying B — U = A — U. 

Finally, we show that (7 r is strictly increasing. Let A,Bg 7^”. Let S be an 
n-simplex disjoint with A, having volume strictly larger than the convex hull of 
A\J B. The boundary C = Bd(5') is a properly joined union of (n — l)-simplices 
and is therefore a member of 7^". It is not difficult to see that d^{A,C) > 
dR{A, B). 



4 Conclusion 

Given a collection of patterns V, and a group of transformations G, finding a 
metric d on V invariant under G is trivial; we can always choose the discrete 
metric. The real problem is finding robust metrics for a class of patterns and an 
invariance group. Until now, robust affine invariant pattern metrics were only 
known for solid sets or simple closed curves. In this paper, we introduced a robust 
metric, called the reflection metric, which is invariant under affine transforma- 
tions, and is defined on both solid and non-solid boundaries. 

We have formalised our notion of resemblance using five axioms. The axioms 
can be verified for any metric defined on a collection of subsets of a topological 
space. Axioms 1-4 express robustness for deformation, blur, cracks and noise, 
respectively. The robustness axioms guarantee that a pattern metric is well- 
behaved with respect to discretisation of patterns. Axiom 5 is complementary 
by enforcing discriminative power on the metric. 

We have used the axioms as a guiding line in analysing various metric pattern 
spaces. We discussed the discrete metric, the Hausdorff metric, the volume of 
symmetric difference and a normalised variant, the difference of normalised in- 
dicators, and the new reflection metric. Table 1 shows for collection of patterns, 
metrics, invariance groups, and satisfaction of axioms for each of the metric 
pattern spaces considered in this paper. 

The reflection metric is defined on the class of finite unions of hyper-surfaces, 
is affine invariant, and satisfies all our axioms. This makes the reflection metric 
especially suitable for matching patterns obtained using image processing tech- 
niques. The reflection metric between two finite segment unions, can be com- 
puted in 0((s -I- k) log(s + k) + v) time, where s is the total number of segments, 
k is the total number of edges in both visibility graphs, and v the number of 
vertices in the reflection-visibility arrangement (at most 0{s^ + fc^)), see [9]. 
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Abstract. The problem of matching two images of the same objects 
but after movements or slight deformations arises in medical imaging, 
but also in the microscopic analysis of physical or biological structures. 
We present a new matching strategy consisting of two steps. We consider 
the grey level function (modulo a normalization) as a probability density 
function. First, we apply a density based clustering method in order to 
obtain a tree which classifies the points on which the grey level function 
is dehned. Secondly, we use the identification of the hierarchical repre- 
sentations of the two images to guide the image matching or to define a 
distance between the images for object recognition. The transformation 
invariance properties of the representations allow to extract invariant im- 
age points. Using the identification of the trees, they allow, in addition, 
to find the correspondence between invariant points even if these have 
moved locally. Then, we obtain the transformation function as the thin 
plate interpolation of the corresponding point pairs. On the other hand, 
if we use tree identification, this enables us to propose several criterias 
to distinguish between real deformations and noise effects. In practice, 
we treat, for instance, hrst coarse trees (with few leaves) and pass to 
ever refining trees, after. The method’s results on real images will be 
discussed. 



1 Introduction 

The use of structural information in images for recognition tasks has been real- 
ized since the inception of machine vision ([22], [23], [1]). A graph structure allows 
us to represent the extracted information. This structure is useful to represent 
information about topology and shape of and between the extracted features 
and therefore to compare two images, but also to guide the process of matching 
corresponding features coming from two different images. In [10] for instance, 
an algorithm for matching Delaunay graphs obtained from selected points of 
segmented lines has been proposed for matching satellite images. If we want to 
match images in which local movements appear (e.g., rigid transformations of 
individual objects in the image or images of deformable objects) a structural 
representation can also, as will be shown, be used to follow real deformations or 
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movements of objects and to distinguish them from noise effects. For this, we 
have to construct the graph (a tree in our case) in such a way that its structure 
is kept even if deformations appear or features have moved. 

A grey level function g : K” — > IR>o,n = 2 or n = 3, with compact support 
I, associates to each physical point in the image scene (represented by IR”) a 
value (g IN in the technical realization) corresponding to its physical properties 
and will be considered as being given by the acquisition device, g is eventually 
smoothed and normalized, in order to have a continuous (or even derivable) 
density function. 

The presented approaches apply density based classification methods ([29], 
[14], [15], [26]) to the grey level function in order to construct trees. These meth- 
ods confine points in clusters at several levels if the points are in regions of 
(respective to the level) high density. We will call the resulting tree (section 2) 
a confinement (or density) tree representation of the image. If we choose the 
classification method proposed in [29] and [15] the resulting tree corresponds to 
the component tree defined in [13]. Under reasonable conditions, we will demon- 
strate that the presented representation is invariant with respect to an arbitrary 
topological transformation (section 2). 

Matching is done now by identification of the two density trees corresponding 
to the two images. The method given therefore tries to associate the leaves 
of the trees and is based on the (intra-) tree distance (defined in section 4), 
which is robust with respect to instabilities in the trees. The instabilities arise 
as we transform a continuous function (the grey level function) into a discrete 
structure. Little noise can result in different tree structures (section 3). 

The idea behind this tree representation matching strategy is that the den- 
sity tree will adapt itself to global and local movements in the image (Figure 
1). This allows us to define characteristic points {candidates for landmarks [4]) 
and to find corresponding point (i.e., landmark) pairs, associated to nodes in the 
tree, even if local movements appear. Moreover it allows us to detect if in one of 
the images a cluster - intermediary node or leave in the tree - has been changed 
by noise effects (section 4). All this has a series of applications as detailed in 
section 5. 

The main originality of our research work lies in two points: 

~ To apply hierarchical classification of the points on which the grey level 
function is defined for image matching, to found this strategy on a topological 
basis, and to define a distance between two images based on their density 
tree representation for object recognition. 

— To propose a tree identification algorithm which respects local instabilities 
of the tree at all levels. The tree identification problem arises also in other 
domains (Shasha et al. [25]). 

In [21] we have already described these principles of the strategy, here we will 
detail the algorithm and show how to obtain practical results. We will remark 
that Leu and Huang [18] used a comparison of ordered trees to define a distance 
between objects for object recognition. However, a tree identification method 
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able to handle instabilities in the tree is not proposed and the algorithm’s suit- 
ability to match images where non-affine deformations appear is not investigated. 
Moreover, the tree construction approach does not use statistical methods, but is 
of geometrical nature based on the closed^ contour in a segmented image. A tree 
matching approach for signal processing has been investigated in [7]. Indepen- 
dently from our work, Kok-Wiles et al. presented in [16] a multi-scale matching 
approach integrating a hierarchical representation of the image. 

A recent general overview of matching algorithms can be found in [20]. 

The remainder of the paper is organized as follows. In section 2, we detail the 
construction of the confinement tree and the invariance theorem associated to 
it. Using the confinement trees of two images we define a distance between them 
in section 3. In section 4, we show how to apply the hierarchical representations 
for image matching and present some results on real images in section 5 before 
concluding in section 6. 



2 Construction and Invariance Properties of a Density 
Tree 

Confronted with the classification task based on a density function we were led 
to a mathematical entity denoted confiner, appearing in a natural way when 
we look for stochastic equivalents of attractors in [8]. Given a density function 
g : IR” ^ K> 0 i n G IM, the confiners are defined as the maximal connected sub- 
sets (or components, Gaal [11], p. 105) Ci of the level sets Li = {x G IR"|(/(x) > 
l},l G IR>o. In the classification domain we found these components first in 
a contribution by Wishart [29] and further investigations by Hartigan [14,15], 
where they are called high density clusters. Considering them taken on several 
levels lk,k = l,...,r (r from resolution) including the 0 level, they define obvi- 
ously a tree (by “set inclusion”) as illustrated in Figure 1. The tree is finite if g 
has a finite number of extremas. We call this tree confinement (or density) tree 
and if g is a grey level function (n = 2, 3) of an Image Ig we use the term con- 
finement tree representation of Ig. The confiners taken at level Ik are the nodes 
(intermediary nodes and leaves) of the tree at level k. We have chosen this clas- 
sification method because of the invariance theorem and its consequences shown 
below. Curiously, in the case of point distributions, a corresponding property 
has already been mentioned in the contributions of Wishart [29] and Hartigan 
[14,15] (without assuming the paradigm to hold and only in the case of an affine 
transformation) even if they did not address a matching problem. 

In practice, we take all grey levels of the image into account and we delete 
confiners with a mass (which we calculate as the sum of all normalized grey 
values associated to the pixels in the confiner) less than a factor A of the mass of 
all confiners at this level. We call A the lop-parameter of the tree representation. 
We choose the discrete d4— distance for defining the connectivity between pixels. 

^ Simply edge detection [6] is not sufficient. 
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Fig. 1. Illustration of the main advantage and the corresponding tree representation. 



The time to calculate the tree is 0(rn), where n is the number of pixels and r 
is the number of levels. 

The following hypothesis, which we formulate first in a strong version, is the 
foundation of the transformation invariance property of our method. In a less 
rigorous framework we can find similar hypotheses in Optical Flow research [3] . 

Medical Imaging Paradigm. Let (j) : A ^ B he the transformation ex- 
pressing movements and deformations of objects in the images A and B which 
appear when passing from A to B, the density associated to image A, fs 
that associated to image B, and X an image point of the image A. Then we can 
assume fA{X) = fB{4>{X)) (apart from local noise effects). 

Let us remark that: (1) for the paradigma to hold we should consider images 
covering the whole space in which movements appear (thus, 3D images in gen- 
eral) and that the paradigma is a realistic assumption in a larger context than in 
medical imaging but not in general; (2) if for two images A and B the acquisition 
device (or its location) has changed we have to assume fA{X) = afB{4>{X)) P, 
where a and /? can be calculated a priori. The following theorem still holds under 
this assuption. 

Invariance Theorem. Let A, B, fA, fB, o,nd (j) he defined as in the Paradigma. 
Then: if (j) is a topological application, then, apart from noise effects, the density 
trees (constructed as proposed above) of images A and B are identical, moreover, 
the confiners of A are transformed into the confiners of B. 
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According to the definition (Gaal [11], pp. 186, 187) a topological transfor- 
mation is bijective and bicontinuous (i.e., </> and (f>~^ are both continuous). As 
the image of a maximal connex set under a topological application stays a max- 
imal connex set, (j) induces, using the paradigm, a bijection between the nodes 
of the two trees; this proves the theorem (see [21]). 

The importance of the property lies on the fact that the tree is invariant 
referred to local movements. The effect of noise will be detailed below. 

We will remark that an octree [24], for instance, would not have this invari- 
ance property. However, in the next section we will be confronted with a problem 
associated to this approach. 



3 Image Distance and Noise Sensitivity 

Algorithms used to compare trees have been developped according to their 
domain of application. In hierarchical clustering and taxonomy Boorman and 
Olivier [5] and Farris [9] compared the shapes of two taxonomic trees. These 
trees have labeled leaves called OTUs which are the same in both trees. Boor- 
man and Olivier decomposed the trees in terms of simpler structures for which 
adequate metrics are available. These simpler metrics are then used to induce a 
distance on trees. Farris referred to the number of nodes on the path between the 
OTUs as the cladistic difference between the OTUs and compared the differences 
corresponding to the same OTUs. Using the idea of language transformation Lu 
[19] derived one tree from the other by a series of elementary transformations, 
with applications in pattern recognition. The distance between the two ordered 
trees is the minimum-cost sequence of transformations. Zhang et al. [30] ex- 
tended this distance definition to unordered trees, this makes its calculation a 
NP-complete problem (Zhang et al. [30]). Shasha et al. [25] proposed an enu- 
merative algorithm and some heuristics for the distance calculation. Restricting 
the allowed tree transformations Tanaka [27] achieved a polynomial matching 
algorithm. Our inter-tree distance is an extension of Farris’s comparison of the 
cladistic differences. 

Let us denote the two trees to compare by A and B. Our distance defini- 
tion is based on the (intra-) tree distance dT associated to a tree T {T = A or 
T = B), which is defined for each couple of nodes (L1,L2) by the sum of the 
weights (for the definition of the weights see section 5) associated to the edges in 
the path which relates the nodes in the tree (Barthelemy and Guenoche [2]). Let 
Nodes(A) (resp. Leaves(A)) denote the set of nodes (resp. leaves) of the tree A. 

A mapping a from Leaves(A) into Nodes(R) is represented by a set of couples 
a = {{Af , Bi)i<i<M , Af G Leaves(A), Bi G Nodes(R)}, 

(M denotes the number of leaves in A; Bi = Bj for i yf j is allowed). The cost 

C(a) of such a mapping is 

C(a) = E ldB(B,,B,)-dA(Af,Af)l. 

Let A4(A, B) denote the set of all possible mappings from Leaves(A) into Nodes(R). 
The inter-tree distance is defined by: 
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d{A,B) 



m'^-m 



min C(a) 

aGM{A,B) 



1 



min C(b), 
beM(B,A) 



( 1 ) 



where M (resp. M') denotes the number of leaves in A (resp. B). 



A problem concerning the invariance of the trees occurs, if more than two 
confiners merge together at the same level approximately (see Figure 2). If, due 
to noise, the order of fusion changes, we will have an exchange between the 
confiner subtrees. Our approach to deal with this is the use of the (intra) tree 
distance in the definition of the inter-tree distance above. We call this problem 
the problem of simultaneous fusion. Figure 3 illustrates the robustness of the 
use of our inter-tree distance d{A, B). 





Fig. 2. Illustration of the problem of simultaneous fusion. 




Fig. 3. Example for weighted trees A and B (each edge is weighted by a certain num- 
ber of I’s) with a subtree exchange and illustration of the robustness of the intra-tree 
distance: for instance |dA(l,5) — dB{a,h)\ = |16 — 18| < |ciA(l, 5) — dB{a,e)\ = 4. 
If Aj: = Aj:^ = 5 and Bkg = r{B), we have = A,A](_^ = 2,Aj(^ = 9 and we get 
bestfit^Bc^) = 3 for the leaves h,b,i and bestfit(Bci) = 7 for the leaves d,f,b. 
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4 Matching Algorithm 

Here we present an algorithm that we have designed to identify the two resulting 
trees. Its interest lies in its efficiency in computation time and in its robustness 
with respect to small differences between the two trees due to noise. The central 
theme behind our algorithm can be seen as the association of the leaves of A 
to the nodes of B such that the minimum 'aAn.a^M(A,B) C{a) in (1) is achieved, 
or reciprocally for minf,g_v((B^yi) C'(&). In order to minimize the cost function we 
could apply a nonlinear optimization method as graduated assignement [12]. 
This is currently under research. The here presented algorithm makes use of 
the special characteristics of the problem such as the possibility to integrate 
geometrical information. First, let us introduce some notations. 

With subtree(N) , where is a node, we denote the subtree rooted in N. 
We will also use the notations father{N), confiner(N), and r(T) = root{T) in 
the evident sense, child{N) for the set of all children of node N, mass{N) as 
the sum over all weights in subtree(N) , and gx {T = A or T = B) for the grey 
level function associated to T. 

In the following algorithm we try to associate each leaf (or a certain number 
of selected leaves, see section 5) A^ of H to a node Bk of B by moving the leaf 
Aj^ downwords in the tree B. 

Initialization : 

We initialize the leaves of A on the root of B. 

Bifurcation : 

If is associated to node Bkg we are looking for the child of B^q which minimizes 
the bestfit value which we are defining now: 

Ako := root of the subtree containing and minimizing \mass{Bko) — mass{Ai)\, 
l<i< M- 
A f' — A f' • 

Ak^ := leaf in subtree{Akg) with largest distance from Ak\ 

A^^ := leaf in subtree{Ak^) which maximizes the sum of its distances to Ak and Ak_^. 
For each Bk^ € child{Bko) do: 

bestfit(Bk^) := minimum of the sum below while minimizing over all triplets 

{Bk^, Bk^, Bk^), with leaf Bkg in subtree{Bk^) and leaves Bk^,Bj^^ in B: 

2 

^ \dA{At„Aj:.) - dB{Bf:^,Bt^)\ + \dA(Al,riA)) - dB{B^^,r{B))\. (2) 

0<i<j<2 i=0 

Return the child Bk^^^^ € child^Bkg) realizing the min of the bestfit among all 
those childs. 

If the bifurcation corresponds to a subtree with just two leaves, we use a geometrical 
criterion easy to define (based on the barycenters of the concerned confiners). 

We will remark, as we are minimizing over all leaves B ^_^ , Bj^^ in the whole 
tree B to define bestfit, that it does not care if subtree(Akg) does not correspond 
exactly to subtree(Bkg) 
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If we fall short of a certain level of resolution the influence of noise can 
become important. In our practical applications this was only the case at the 
last bifurcations of some of the branches (section 5). Therefore we could use 
the geometrical criterion based on the barycenters of the concerned confiners 
for the decision at the last bifurcation (see Figure 5,6). For instance, we im- 
pose a limit for the distortion of the transformation function (visualized by the 
grid in Figure 6). Or we set bounds for the values relPos{Ni, N2,4>) (eventu- 
ally normalized by min(relPos{Ni, iVi, id), relPos{N 2 , N 2 , id))), where we define 
relPos{Ni, N 2 ,(p), the relative position between of A and N 2 of B, as the 
normalized sum 

{9a{P\) + 9B{(l>{P2)))d{Pi,(l){P2)) 

Pi6c7p2eC2 mass{Ni) + mass{N 2 ) 

where C\ = confiner{Ni),C 2 = confiner{N 2 ), and where d{P\,P 2 ) is the eu- 
clidian distance between the points (pixels) Pi and P 2 - Another way to define 
a stop criterion is to demand that, if is the second best choice after 

^ibestJ that: bestfit{Bi^^^^^^) > 2bestfit. A third way consist in taking the leaves 
Bq, B^ , B 2 for which the minimum for best fit (in equation (2)) is achieved and 
search for them the leafs in subtree(Akg) realizing the minimum in equation (2) 
written for Bq, Bf, and B 2 as initial leaves. If the leaf corresponding to Bq is 
not in the same subtree of subtree{Akg) as Aj^^ we stop and remove it if other 
selected leaves A^, are in subtree(Akg) ■ If not, we associate A^g and B^g- 

Another way to identify the two trees is based on the lop parameter A (section 
2). We lop A and B with lop parameter A1 such that just 5 leaves rest in A 
and and we associate the leaves in A with the nodes in B minimizing the cost 
function. We remove outliers from the mapping. After, we choose A2 such that 
6 leaves appear in A and associate the new leaf, minimizing the cost function, 
where we fix the previous mapping, and so on (Figure 7). 

5 Applications 

Applications of our method include the study of the relative movement of grains 
in microscopic metallurgical images or of cells in microscopic biological images. 
On the other hand, we can also employ our method to detect asymetries in 
breast or brain images, or to detect where in an image movements occur. In the 
first case this serves to detect a tumor and its location. In the second case the 
method helps us to apply watershed, optical flow or active contour techniques at 
the found place . If we just want to rediscover the affine transformation between 
two succesive 2D slides of a 3D image, we apply a least square minimization of 
the euclidean distance between the corresponding invariant characteristic points 
found by our identification algorithm. An alternative to this is to proceed as 
Thirion [28] for matching extremal points and just to define the characteristic 
points with the help of the trees. 

A further interesting application of the proposed method is the comparison 
of two brain images resulting from functional MRI. Here occur small local defor- 
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mations as well as global shifts. The aim is to locate positions where the brain 
activation (and therefore the intensity of the functional signal) changed. 

The Figure 4 shows the tree representations of the images and the selected 
nodes of the trees. The selected nodes are the nodes in the middle of the sub- 
branches presenting the largest number of tree levels between two bifurcations 
and having no subbranch with the same property as successor. Figure 5 shows 
the new position in the lower tree of Figure 4 of the selected nodes of the upper 
tree after the application of our matching algorithm which we simplified. We 
get the best results (Figure 5) when we weight each edge of the tree just by 
a constant such that both trees have the same total weight. This results from 
the fact that the changes in grey intensity are larger than those concerning the 
topological structure, especially in the upper right part of the brain, where a tu- 
mor is located. However, for detecting the tumor or its location after matching, 
weights constructed from the intensity values contain useful information. 




Fig. 4. Top left: Initial image of the upper slide of the brain image; middle: correspond- 
ing density tree; top right: highlighted confiners as marked by the arrows in the tree; 
bottom: the same as above for lower slide of the brain image. 
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Fig. 5. Initial image of the lower slide of the brain image on the left, confiners found 
after tree identification marked by arrows in the corresponding density tree in the mid- 
dle, and highlighted confiners as marked by the arrows in the tree on the right. Outlier 
2 can he removed with one of the stop criteria. 




Fig. 6. Initial image of the upper slide of the brain image with initial grid on the left, 
image of the lower slide and transformed grid after thin plate spline interpolation based 
on the confiners in the figure before except confiner 2 in the middle, and transformed 
grid based on the same confiners except confiner 2 and confiner 3 on the right. 




Fig. 7. From left to right: (1) Image of the lower slide of the brain image and highlighted 
confiners as marked by the arrows in the tree next to it, (2) tree A loped such that just 
5 leaves occur, image of the upper slide with highlighted confiners as marked by the 
arrows in the tree on its right, (4) tree B loped with the same lop parameter as tree A. 
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6 Conclusion 

We showed the usefulness of a tree representation of a density or grey level 
function known in statistics {density or confinement tree) and in image analysis 
{component tree). We based this on a topological invariance theorem we could 
establish under reasonable (in practice) hypothesis, which justifies the use of 
this tree for image matching and object recognition. In order to get practical 
results we have presented a tree identification algorithm and several criteria for 
detecting automatically the level at which the resolution becomes noise sensitive. 

References 

1. Barrow, H.G., Poppelestone, R.J.: Relational Descriptions in Picture Processing. 
Machine Intelligence 6 (1971) 377-396 

2. Barthelemy, J.-P., Guenoche, A.: Les arbres et les representations des proximites. 
Masson, Paris, (1988) 

3. Black, M. Jepson, A.D.: Estimating optical flow in segmented images using variable- 
order parametric models with local deformations. IEEE Trans. PAMI 18 (1996) 

4. Bookstein, E.: Principal wraps: Thin-plate splines and the decomposition of defor- 
mations. IEEE Trans. PAMI 11 (1989) 567-585 

5. Boorman, S.A., Olivier, D.C.iMetrics on spaces of finite trees. J. Math. Psych. 10 
(1973) 

6. J. Ganny: A computional approach to edge detection. IEEE Trans. PAMI 8 (1986) 
679-698 

7. Gheng, Y.C., Lu, S.Y.: Waveform correlation by tree matching. IEEE Trans. PAMI 
7 (1985) 299-305 

8. Demongeot, J., Gosnard, M., Jacob, G.: Attractors and confiners: Deterministic and 
stochastic approaches. In: Diner, S., Fargue, D., Lochak, G. (eds.): Dynamical systems 
- A renewal of mechanism. Centennial of Gorge David Birkhoff. World Scientific, 
Singapore (1986) 

9. Farris, J.S.: A successive approximations approach to character weighting. J. Syst. 
Zool. 18 (1969) 374-385 

10. Finch, A.M., Wilson R.C., Hancock, E.R.: Matching Delaunay graphs. Pattern 
Recognition 30 (1997) 123-140 

11. Gaal, S.A.: Point set topology. Academic Press, New York, (1964) 

12. Gold, S., Rangarajan, A.: A graduated assignement algorithm for graph matching. 
IEEE Trans. PAMI 18 (1996) 377-388 

13. Hanusse, P., Guillataud, P.: Semantique des images par analyse dendronique. In: 
AFECT, 8th Conf. Reconnaissance des Formes et Intelligence Artificielle, Vol. 2. 
Lyon (1992) 577-588 

14. Hartigan, J.A.: Clustering algorithms. Wiley, New York, (1975) 

15. Hartigan, J.A.: Statistical theory in clustering. J. of Classification 2 (1985) 63-76 

16. Kok- Wiles, S.L., Brady, J.M., Highnam, R.: Comparing mammogram pairs for the 
detection of lesions. In: Karssemeijer, N. (ed.): 4th International Workshop of Digital 
Mammography, Nijmegen, Netherlands, June 1998. Kluwer (1998) 

17. Lavallee, S., Szeliski, R.: Recovering the position and orientation of free-form ob- 
jects from image contours using 3D distance maps. IEEE Trans. PAMI 17 (1995) 
378-390 




Tree Representation for Image Matching and Object Recognition 



309 



18. Leu, J.G., Huang, I.N.: Planar shape matching based on binary tree shape repre- 
sentation. Pattern Recognition 21 (1988) 607-622 

19. Lu, S.-Y.: A tree-to-tree distance and its application to cluster analysis. IEEE 
Trans. PAMI 1 (1979) 210-224 

20. Maintz, J.B.A., Viergever, M.A.: A survey of medical image registration. Medical 
Image Analysis 2 (1998) 1-36 

21. Mattes, J., Demongeot, J.: Dynamic Quantization, Classification, and Imaging. 
In: 22nd Annual Conference of the Cesellschaft fiir Klassifikation e.V., Dresden, Ger- 
many, March 1998. Studies in Classification, Data Analysis, and Knowledge Organi- 
zation. Springer- Verlag, Berlin Heidelberg New York (to appear) 

22. Pavlidis, T.: Analysis of set patterns. Pattern Recognition 1 (1968) 

23. Rastal, J.S.: Graph family matching. Research Memorandum, MIP-R-62. Departe- 
ment of Machine Intelligence and Perception Edinburgh (1969) 

24. Samet, H.: The design and analysis of spatial data structures. Addison Wesley 
Publishing Company, Reading, Massachusetts (1990) 

25. Shasha, D., Wang, J.T.-L., Zhang, K., Shih, F.Y.: Exact and approximate algo- 
rithms for unordered tree matching. IEEE Trans. Syst., Man, Cybern. 24 (1994) 
668-678 

26. Silverman, B.W.: Density estimation for statistics and data analysis. Chapman 
and Hall, London (1986) 

27. Tanaka, E.: A metric between unrroted adn unordered trees and its bottom-up 
computing method. IEEE Trans. PAMI 16 (1994) 1233-1238 

28. Thirion, J.-P.: New feature points based on geometric invariants for 3D image 
registration. International Journal of Computer Vision 18 (1996) 121-137 

29. Wishart, D.: Mode analysis: A generalization of the nearest neighbor which reduces 
chaining effects. In: Cole, A. J. (Ed.): Numerical Taxonomy. Academic Press, London 
(1969) 282-319 

30. Zhang, K., Statman, R., Shasha, D.: On the editing distance between unordered 
labeled trees. Information Processing Letters 42 (1992) 133-139 




Ultra-Fast Skeleton Based on an Isotropic Fully 
Parallel Algorithm 



Antoine Manzanera^^^, Thierry M. Bernard^, Frangoise Preteux^, and Bernard 

Longuet^ 

^ Aerospatiale E/SCS/V, 2 Rue Beranger, 92320 Chatillon Cedex, France 
Bernard. LonguetOmissiles . aerospatiale . fr 
^ ENSTA/LEI, 32 Bd Victor, 75015 Paris, France 
Thierry . BernardSenst a . f r 

® CTA/GIP, 16bis Avenue Prieur de la Cote D’Or, 94114 Arcueil Cedex, France 
Antoine .Manzanera@etca.fr 

INT/Signal et Image, 9 Rue Charles Fourier, 91011 Evry Cedex, France 
Francoise .Preteux@sim. int-evry . f r 



Abstract. In this paper we introduce a new thinning algorithm, called 
MB, which is optimized with respect to the total number of elemen- 
tary Boolean operators needed to perform it. We first emphasize the 
sound foundations of the algorithm, which is built by expressing into 
the Boolean language the three following constraints: (1) homotopy, (2) 
median axis and (3) isotropy. The MB algorithm benefits from both 
novel algorithmic ideas and systematic logic minimization. By hunting 
down any redundancy in the expressions of topological/geometrical fea- 
tures, we achieve a procedure that is: firstly, dramatically low-cost, as 
it is completely computed in 18 Boolean binary operators per iteration, 
and secondly, fully parallel, or one-single-pass, which guarantees that the 
number of iterations equals half the biggest object thickness. 

Keywords: Thinning - Skeleton - Fully Parallel Algorithm - Isotropy - 
Median Axis - Discrete Topology - Logic Minimization - Boolean Com- 
plexity. 



1 Introduction 

The process that consists in peeling iteratively a black object on a white back- 
ground while preserving its topological properties has given rise to intensive 
international research activities for the last forty years with hundreds of pub- 
lished papers. The importance of such an investigation is due to the interest in 
digital skeletons (or image thinning), as a fundamental pre-processing step in 
pattern recognition and image analysis. Its duration is due to the fact that no 
definitely satisfying solution has been found. 

Properties required for a skeleton can vary slightly according to the appli- 
cations under study. Let us mention the most usual ones, ranked by order of 
importance: 
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— Homotopy: the skeleton must preserve the topology of the original image. 

— Median axis: the skeleton must contain the centres of maximal balls. 

— Isotropy: skeletonization and object rotation must commute, at least for 
rotation angles multiples of 7 t/2. 

— One-pixel Thickness: speaks for itself. 

— Noise Immunity: the skeleton must be fairly unsensitive to noise on the 
image (boundary pixels added or removed) . 

It is well-known, however, that theses requirements cannot be satisfied si- 
multaneously, as some of them are contradictory. In particular. Noise Immunity 
is incompatible with keeping the whole median axis in the skeleton. Likewise, 
One-pixel Thickness is incompatible with Isotropy, but can be obtained by a 
low-cost post-processing step on two-pixel-thick lines. So we decide to restrict 
ourselves to the three former constraints, as they seem the most fundamental 
ones. 

Thinning algorithms which are both iterative and local may be either se- 
quential or parallel. In this paper, we consider only the parallel framework. We 
propose a dramatically fast thinning parallel algorithm called MB (after the 
names of its two creators). Indeed MB proves to be faster than all other algo- 
rithms we know of, in a fundamental sense: for us, the fastest possible algorithm 
is the one that minimizes the overall computational effort required to achieve 
the desired image transformation. More precisely, whereas the cost of parallel 
algorithms is often defined as the number of iterations before convergence [5], 
[4], we rather define it as the number of elementary operations performed per 
pixel during the complete execution of the algorithm. 

The paper is organized as follows. In the next section, we simply present 
the MB algorithm as a recipe, and show some experimental comparative results. 
Section 3 introduces the complexity measure we use to quantify the compu- 
tational cost of the MB algorithm and its competitors. Section 4 details the 
different issues of parallel thinning (according to the previously chosen criteria) 
and explains how the MB algorithm deals with them. 

2 The MB algorithm 

Let us first introduce the formalism and related notations used in this paper. 
Here, is the discrete plane, represented by a square mesh. A binary image X 
is a subset of Considering X, a point p of or pixel is characterized by its 
binary value x = lx{p), where lx is the characteristic function of the set X. The 
convention used to designate the value of an 8-neighbor is shown on Figure 1. 
The MB procedure appears in Table 1, including some hint comments to get the 
spirit of the algorithm at once (before reading Section 4) . Computations therein 
only involve Boolean operators (i.e. Boolean functions of two binary variables). 
Four of them are actually used, with the following notations: symbol A stands 
for the conjunction (AND), symbol V for the disjunction (OR), symbol \ for the 
exception (AND NOT) and symbol 0 for the inequality (XOR). The procedure 




Ultra-Fast Skeleton Based on an Isotropic Fully Parallel Algorithm 315 




Fig. 1. Pixels valnes in the 8-neighborhood. 



Table 1. The MB procedure. 



The following sequence is repeated for all pixels within a parallel framework, until 

stability: 

(1) a = xn a xe 

(2) a — a A asw A x 

1 1 A = eroded set of X 

(3) b = {as \ Xn) V (ajv \ xs) V {ue \ xw) V {aw \ xe) 

I j 'Q = {pixels between a point in A and a point in X } 

(4) c = x®xe 

(5) c = c A cn A {x (B Xn) 

(6) c = c V cw 

(7) c = cV cs 

// C = {pixels that belong to pattern (/?) of section 4.3 } 

{8) X = X \ {b\ c) 

II removed pixels are those that belong to B but not to C 



uses intermediate pixel binary variables o, b and c that actually correspond to 
binary images A, B and C. Figure 2 shows some results of the MB algorithm. On 
these various examples, the three criteria we have selected in the introduction 
are met. The two-pixel-thick segments that appear in the produced skeleton are 
a natural outcome of the isotropy of the MB procedure. Anyway, there exist 
arbitrary thick objects that must not be thinned, such as the one shown in the 
top right corner of the image, which is indeed preserved by the MB algorithm. 
Table 2 outlines the main characteristics of the MB algorithm, to be detailed in 
the next sections. 



Table 2. Main characteristics of the MB algorithm 



— Fully Parallel thinning algorithm. 

— Number of iterations before convergence equals half the maximal thickness. 

— 18 elementary Boolean operations per iteration. 
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3 Complexity measure 

Before going further into the MB algorithm, let us consider its complexity. Coun- 
ting the number of symbols A, V, 0 and \ in the MB procedure (cf. Table 1), leads 
to assign a computational cost of 18 to each iteration. This measure represents 
the number of two-input logic components that are needed for the computation, 
no matter if it is at the software or hardware level. This measure has actually 
been used since the early times of logic synthesis to define the complexity of 
Boolean functions, as recalled hereunder [17]: 

Definition 1. Let f : {0, 1}” — > {0, 1} be a Boolean function of n variables. 
Let B 2 be the set of the Boolean funetions of two variables ( and, or, xor . . .). 
A cireuit 7 over B 2 is a eonnected, aeyelic oriented graph with nodes either 
having in-degree 2 and labeled by a function of B 2 (these nodes are ealled gates^, 
or having in-degree 0 and labeled by variables (these nodes are ealled leavesj. 
Only one node has out-degree 0 and is ealled output. A node eomputes a value 
by applying its labeling function to the values computed by its preceeding nodes. 
Thus a circuit 7 with n distinct variables labeling its leaves computes a Boolean 
function f of n variables in its output. 7 is called a syntax of f , whereas f is 
called the semantic of^. 

Definition 2. The Shannon eomplexity of a Boolean function is equal to the 
number of gates of its most compact syntax. 

Here, due to the parallel expression of the MB algorithm, the Shannon mea- 
sure represents the minimal execution time on a basic Cellular Automata ma- 
chine, just as the one developed in our team, under the form of a Programmable 
Artificial Retina [13]. However, the Shannon measure makes sense for any soft- 
ware or hardware implementation, as it is related to Area-Time complexity. It 
also represents the minimal amount of energy required to perform the computa- 
tion of a given Boolean function, as every operation between two bits of infor- 
mation physically consumes some energy quantum, whatever the computation 
medium. 



4 Computing digital skeletons 

We now move to the algorithmic principles. Here, the MB algorithm is presented 
following a constructive approach. Using fully isotropic procedures, it is shown 
first how to keep the medial axis in the skeleton, then how to preserve homo- 
topy, even when pixels are removed in parallel. For each of these issues, some 
basic definitions or results are recalled before a precise explanation of the MB 
algorithm operation is provided. 



4.1 Distance map - Local maxima 

In the discrete plane represented by the square mesh, two topologies can be 
defined, depending on the neighborhood relation: either the 4-connectivity, where 
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a pixel has one neighbor in each cardinal direction (N, E, W and S), or the 8- 
connectivity, where a pixel has 4 more neighbors (NE, NW, SW, SE). This 
induces two basic distances in 7Z^'. 6^, or diamond distance, and (5s, or square 
distance. 




Fig. 2. Experimental results. Image (a) is the original one, whereas image (b) is the 
result of the convergence of the MB algorithm, reached after 6 iterations. Images (d) 
to (h) show a detail for the 4 first iterations (the grey pixels are the removed ones) and 
image (i) represents the overall process. For comparison purposes, image (c) shows the 
result obtained by the Jang and Chin algorithm [8] on the same image. 



Definition 3. Let S be a distance in TZp' . Let = {y G S{x,y) < n} be 

the ball of centre x G TZp' and radius n € IN. Let X C . The collection of the 

centres of maximal balls is the set: 

S{X) = [j {x e X andWy G G TN.Bl C X ^ Bif ^ Blf,} 

neTN 

Definition 4. Let X C . The distance map associated with X is the func- 
tion: 

<T-.2Z^ — 

X I — > <L>{x) = S{x,X‘^) 

Property 1. 5(AT) = lj{a; G ^;Vj/ G Bf,T>{y) < T>{x)}. In other words, the 
collection of the centres of maximal balls corresponds to the set of local maxima 
of the distance map. 
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This set is related to the notion of median axis. Historically, it has been the first 
(and last !) sheer definition of the skeleton [2]. However, it is not a satisfying 
shape descriptor, since it does not preserve topology. 

Anyway, we must guarantee that the set we are building by iterated passes of 
thinning contains the median axis. A natural idea is to compute explicitely the 
set of local maxima in order to impose its presence in the result [20] , [3] . The 
simplest iterative parallel procedure that provides the local maxima for distance 
64 is shown on Table 3. Lines 1 and 2 in Table 3 compute the eroded set of 



Table 3. Local maxima procedure. 

(1) a = xn a xe 

(2) a — a A asw A x 

(3) b — un V as 

(4) b — b\/ bsw V a 

(5) X = x\b 



image X, i.e. the set A of pixels of X whose 4-neighbors are all in X. Lines 3 
and 4 compute the dilated set of A, i.e. the set B of pixels of for which at 
least one 4-neighbor is in A. Finally the set-difference of X and B is computed 
in line 5. This procedure removes every pixel x whose neighborhood matches 
the pattern shown on Figure 3.1, or one of its 7t/2 rotated configuration. The 
hachured pixels have value 1, and at least one of the three pixels denoted by y 
has value 0. The computational cost is 7. What we compute in the MB algorithm 





( 2 ) 



Fig. 3. (1) Non local maximum pattern and (2) Pattern (a). 



is slightly different, and actually a bit more restrictive. Refering to Table 1, lines 
1-3, together with line 8 (in which we temporarily forget about c), removes every 
pixel X whose neighborhood matches a pattern called (a), shown on Figure 3.2, 
or one of its 7t/2 rotated configuration. The hachured pixels have value 1, the 
white pixel has value 0. Thus it is obvious that a super-set of the local maxima 
will be produced by the MB algorithm: this makes the first step of a proof. 
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Matching pattern (a) increases the cost up to 11 operations (instead of 7), but 
as we will see, it is worth the extra. 

4.2 Homotopy 

The notion of homotopy in the real plane is originally defined between arcs, 
i.e. continuous functions from a real interval [a, b] to Two arcs are homotopic 
if there exists a continuous function mapping one to the other. This relation being 
an equivalence relation, it is said that 2 open subsets of are topologically 
equivalent if their sets of equivalence classes are isomorphic. In other words, 
two subsets of the plane are topologically equivalent (we may say homotopic) 
if there exists a one-to-one mapping between their connected components, and 
another one between the connected components of their complementary sets (the 
holes), both preserving the relation “is contained inside...”. Indeed, a connected 
component (or a hole) Y is said to be contained inside the connected component 
X if there exists a closed arc T in X that surrounds Y. This means that Y 
is completely contained in the interior of T. The notion of homotopy is indeed 
sharply based on the Jordan theorem : a simple closed arc ( or Jordan arc) divides 
the plane in two distinct regions : the interior and the exterior. As mentioned 
earlier, two kinds of topology can be used in the discrete plane. In both cases, 
however, the Jordan theorem is no longer valid, and so the notion of hole is not 
well-defined. To cope with this problem, two distinct topologies must be used 
for the image and for its complementary. So homotopy will make sense if one 
always considers the 8-connectivity for the image and the 4-connectivity for its 
complementary (or vice-versa). We may now define the notion of simple point. 

Definition 5. Simple point 

Let X C be a binary image. Let a £ X. Point a is simple iff X\{a} is 

homotopic to X. 

A well-known result is that the simplicity of a point can be determined by 
looking only at its 3x3 neighborhood. The forerunner in the search of a formula 
over the 3x3 neighborhood to characterize simplicity is Rutowitz [16]. The more 
elaborated notion of connectivity number was first defined by Yokoi [22], though 
it had been used earlier by Hilditch [7]. The 8- and 4-connectivity numbers are 
respectively defined as follows: 

Definition 6. Connectivity numbers 

3 3 

Afcg = ^(X2i+l V X2i+2) \ X2i . Mc4 = '^X2i\ {X2t+1 A X2i+2) ■ 

i—0 i— 0 

where the Xj represent the 8-neighbors of x in the following way: xq = xe, and 
so on in a counter-clockwise sense until xy = xse- Sign -|- within indices stands 
for the addition modulo 8. The N-connectivity^ number of a pixel x within its 
8-neighborhood V (x) is exactly equal to the number of N-connected components 



1 



N = 4 or 8. 
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of V (x)\{a;}, except for N-interior points (points of the image for which all their 
N-neighbors are also in the image). The N-connectivity number provides an easy 
way to characterize the N-simplicity. A point is N-simple if and only if its N- 
connectivity number equals one. 

We can compute the 8 -simplicity of pixels by using the procedure shown on 
Table 4, to be read column after column. It computes exactly the formula of 
Definition 6 and the related cost is 15 operations. The 8 -simplicity can also be 



Table 4. Hilditch procedure. 



(1) a = x\/ xe 

(2) d = On \ Xe 


(6) a = a; V xn 

(7) b = aw \ Xn 


(10) h = asE \ xs 

(11) c = cV(dA6) // c = 1 


=4> Mc% > 1 


(3) a = asw \ xw 


(8) c = c V (d A &) (12) d = d V & 


! j d^\^ 


=4> Nc% > 1 


(4) c = d A a 

(5) d = d V a 


(9) d = d V & 


(13) a — d\c 


j j 


=4> Mc% = 1 



expressed under the following pattern-matching form. A pixel x is neither an 
interior nor a simple point if its neighborhood matches one of the two patterns 
represented in Figure 4, along with their 7t/2 rotated configurations. The con- 




(P) (y) 

Fig. 4. Gap patterns. 



verse is also true, except if V (x) does not contain more than two points. By 
the way, this is the property that was used by Arcelli [1] for the computation 
of his skeleton. Here it reveals some redundancy between the computations of 
local maxima and simple points. Indeed, if a pixel x is considered as removable 
by the first part of the MB algorithm (because it matches pattern (a)), then its 
neighborhood may match pattern (/3), but never pattern ( 7 ). This fact makes it 
possible to save operations, by computing only the pattern matching of (/?), as 
performed by the end of the MB procedure (lines 4-7). 

So, knowing which pixels match pattern (a) (or a 7 t /2 rotated configuration) 
reduces to 6 (instead of 15 !) the number of operations required for the compu- 
tation of simplicity. This a posteriori justifies the enigmatic choice of (a) in the 
previous section. Anyway, we have now shown that the points removed by the 
MB algorithm are neither local maxima nor simple points. This is still not suffi- 
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dent as a proof, since we must now consider the last but not least characteristic 
of the MB algorithm: the parallel removal. 

4.3 Parallel computing of the skeleton 

As a matter of fact, simplicity according to Definition 5 is an individual charac- 
teristic. In general, the parallel removal of simple points leads to disconnection, 
with the disappearance of the two-pixel-thick lines. To cope with this major 
difficulty, three different approaches have been devised: 

• Directional sub-iterations: Rosenfeld [9] proved that simple points could be 
removed in parallel without changing topology if one allows the pixels to be re- 
moved in one direction only (e.g. east border simple points). In this kind of algo- 
rithms, that we now denote DS-algorithms, the thinning process peels the image 
removing border pixels in one direction after the other. The most efficient of these 
algorithms operate in two passes (e.g. N and W, followed by S and E) [19]. The 
characterization of the skeletal points is in general simplified [8], but the number 
of iterations multiplied by two. Note that, in such a case the skeleton is by nature 
non isotropic, since the result depends on the order in the processing directions. 

• Semi-parallel sub-iterations: by degenerating the parallelism according, for 
example, to the respective parity of the lines and columns, one may clusterize 
the pixels, and compute a parallel iteration of the skeleton by processing as much 
passes as there are pixels in the cluster [12]. The result of these algorithms (SP- 
algorithms) is also sensitive to the choice of the order in the sub-iterations, and 
furthermore, it produces zigzagging artifacts on straight lines, as pointed out in 
[ 10 ]. 

• Fully parallel: removing non-skeletal pixels in all directions without chang- 
ing the topology is possible provided that larger neighborhoods are examined 
[6] . As a consequence, if the number of iterations is reduced in comparison to 
multi-subiterations algorithms, the computational cost of one iteration is always 
larger, to such a point that their speed may be smaller in some cases. See for 
example, [18], [21], and [5]. A general scheme to prove the soundness of FP- 
algorithms is provided by Ronse [15], who extended the notion of simplicity to 
sets of points, and proved that the parallel notion of set-simplicity is equiva- 
lent to the existence of an order according to which all points in the set can be 
removed sequentially, every point being simple when it is removed [14]. 

Now the good news is that the MB algorithm proves to belong to the FP- 
class: points that match pattern (a) but not ((3) actually form a simple set! In 
[15], Ronse gave minimal conditions for checking connectivity preservation of 
parallel thinning algorithms. Theorem 3.5 together with Proposition 3.6 of this 
paper leads to the following property: A parallel thinning algorithm preserves 
8-connectivity if: (1) The pixels removed are simple, (2) A connected component 
contained in a 2x2 square cannot be completely removed, and (3) Two 4-con- 
nected pixels removed at the same time would have been removed as well if 
treated sequentially. 

Condition (1) has been proven in Section 4.2. Condition (2) is obviously met, 
as a pixel is removed only if it is 4-connected to an interior pixel. To see that 
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Condition (3) is also satisfied, let us notice that two 4-adjacent pixels are si- 
multaneously removed only if they match the pattern showed on Figure 5, in 
which none of the two pixels belongs to the pattern (a) of the other. This proves 




Fig. 5. Case of two pixels 4-connected. 



that the MB algorithm is an FP one. As such, its number of iterations before 
convergence equals the maximum of the distance map (cf. Definition 4), i.e. 
half the thickness of the biggest object in the image. Finally, removing simple 
sets, but as big as possible, while preserving the median axis, through such a 
compact isotropic procedure is the remarkable achievement of the MB algorithm. 
Its superiority can be appreciated on Table 5 which shows a comparison with 
several (fully or not) parallel algorithms: MB is nearly twice as fast as the fastest 
known algorithm ! 



Table 5. Comparing the cost of parallel thinning algorithms, d stands for the maximal 
object thickness (i.e. the diameter of the biggest ball contained in the image). The 
number of elementary operations given for the other algorithms is sometimes only an 
estimation, as the quoted papers did not always provide a logic minimization of their 
algorithm. Note that [5] was ranked second among ten algorithms in [11], and the best 
FP one. 



Algorithm 


Type 


Size of neighborhood 
examined 


Number of elementary 
operations required 


Jang & Chin [8] 


DS 


7 


16 X d 


Cardoner & Thomas [3] 


DS 


7 


20 X d 


Stewart [18] 


FP 


19 


30 X d 


Wu & Tsai [21] 


FP 


11 


30 X d 


Guo & Hall [5] 


FP 


11 


40 X d 


Manzanera & Bernard (MB) 


FP 


13 


9 X d 



4.4 Isotropy and One-pixel Thickness 

The MB algorithm computes the skeleton by logical operations that are based 
on the matching of patterns (a) and (/3), with all their tt/2 rotated configura- 
tions. Then these operators are completely symmetrical. As the algorithm is fully 
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parallel (no spatial subiteration), this guarantees isotropy. Then the presence of 
two-pixel-thick lines in the skeleton is expected. If required, one-pixel thickness 
can be obtained by using two directional passes at a very low extra cost. For 
example, if we compute a North, then a West point deletion with the Jang & 
Chin algorithm, the post-processing has a cost of 16 operations (whereas an MB 
iteration costs 18). 



5 Conclusion 

We have presented a new parallel iterative thinning algorithm which computes 
the skeleton of an image in what we believe to be the purest form. This skeleton 
must (a) contain the median axis for the diamond distance, (b) preserve topology 
and (c) be obtained through a purely isotropic procedure. 

To remove pixels from the original image, the MB algorithm fulfils the three 
former requirements by checking two pixel conditions that we may call “definitely 
not a local maximum” (i.e. matching a) and “semi-simple” (i.e. not matching 
j3) . While extremely simple to compute, the combination of these two conditions 
allows a most efficient fully parallel (or one-pass) operation of the MB algorithm. 
With the help of systematic logic minimization, this results into a dramatically 
low computational cost in terms of logical operations needed for the complete 
execution of the algorithm. 

In spite of its (much) faster computation, the MB skeleton is comparable in 
quality to those produced by other algorithms, as shown by the experimental 
results. While its preferred implementation is on SIMD machines that compute 
binary logic, the speed advantage of the MB algorithm should remain on most 
computers, parallel or not, due to the fundamental character of the Shannon 
measure. 
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Abstract. Thinning of a binary object is an iterative layer by layer 
erosion to extract an approximation to its skeleton. In order to pro- 
vide topology preservation, different thinning techniques have been pro- 
posed. One of them is the directional (or border seqnential) approach 
in which each iteration step is subdivided into subiterations where only 
border points of certain kind are deleted in each subiteration. There 
are six kinds of border points in 3D images, therefore, 6-subiteration 
parallel thinning algorithms were generally proposed. In this paper, we 
present two 8-subiteration algorithms for extracting “surface skeletons” 
and “curve skeletons” , respectively. Both algorithms work in cubic grid 
for (26,6) images. Deletable points are given by templates that makes 
easy implementation possible. 



1 Introduction 

Thinning is a common pre-processing operation in pattern recognition. Its goal 
is to reduce binary objects to their “skeletons” in a topology-preserving way [4]. 
Border points of the binary object that satisfy certain topological and geometric 
constraints are deleted in iteration steps. Deletion means that I’s (object ele- 
ments) are changed to O’s (background elements). The entire process is repeated 
until only the “skeleton” is left. 

A thinning algorithm does not preserve topology if 

— any object in the input picture is split (into two or more ones) or completely 
deleted, 

— any cavity in the input picture is merged with the background or another 
cavity, or 

— a cavity is created where there was none in the input picture. 

There is an additional concept called hole in 3D pictures. A hole (that dough- 
nuts have) is formed by O’s, but it is not a cavity [4,5]. Topology preservation 
implies that eliminating or creating any hole is not allowed. 

A simple point is an object point whose deletion does not alter the topology 
of the picture [10]. Sequential thinning algorithms delete simple points which 
are not end points, since preserving end-points provides important information 
relative to the shape of the objects. Curve thinning preserves line-end points 
while surface thinning algorithms do not delete surface-end points. 

Parallel thinning algorithms delete a set of simple points. A possible approach 
to preserve topology is to use directional strategy; each iteration step is composed 
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of a number of parallel subiterations where only border points of certain kind 
can be deleted in each subiteration. There are six kinds of border points in 3D 
images on cubic grid, therefore, 6-subiteration parallel thinning algorithms were 
generally proposed [2,3,6,11,13,15] (with the exception of [12]). 

In this paper, 8-subiteration directional algorithms are proposed for curve 
thinning and surface thinning. Our approach demonstrates a possible way for 
constructing non-conventional directional thinning algorithms. 



2 Basic notions and results 

Let p be a point in the 3D digital space 2Z^ . Let us denote iVj(p) (for j = 
6, 18,26) the set of points j-adjacent to point p (see Fig. 1 (a)). The sequence 
of distinct points {xq,xi, . . . ,x„) is a j-path of length n > 0 from point xq to 
point x„ in a non-empty set of points X if each point of the sequence is in X 
and Xi is j-adjacent to for each 1 < z < n. (Note that a single point is a 
j-path of length 0.) Two points are j-connected in the set X if there is a j-path 
in X between them. A set of points X is j-connected in the set of points Y D X 
if any two points in X are j-connected in Y. 
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Fig. 1. The frequently used adjacencies in (a). The set Ne{p) contains the central 
point p and points marked U, N, E, S, W, and D. The set of points Nig(p) contains 
the set Ne{p) and points marked The set of points N^eijp) contains the set N\s(p) 
and points marked “o”. 

The usual 6 directions (b) corresponding to N^{p) and the proposed 8 deletion direc- 
tions (c) corresponding to N 26 {p)\Nis{p). 



The 3D binary (m,n) digital picture V is a, quadruple V = (7Z^ ^ m, n, B) [4]. 
Each element of 2Z^ is called a point of V. Each point in S C 2Z^ is called a 
black point and value 1 is assigned to it. Each point in is called a white 

point and value 0 is assigned to it. Adjacency m belongs to the black points 
and adjacency n belongs to the white points. A black component (or object) is 
a maximal m-connected set of points in B. A white component is a maximal 
n-connected set of points in B C 2Z^ . 

We are dealing with (26,6) pictures. It is assumed that any picture contains 
finitely many black points. 

A black point is said to be border point if it is 6-adjacent to at least one white 
points. (Note that this definition is correct only for the special cases m = 26 and 
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m = 18.) A border point p is called \3-horder point if the point marked by U 
in Fig. 1 (a) is white. We can define N-, E-, S-, W-, and D-border points in 
the same way. The set Nq{p) are subdivided into three kinds of opposite pair of 
points (U,D), (N,S), and (E,W). 

A black point is called simple point if its deletion does not alter the topology 
of the picture. We make use the following result for (26,6) pictures: 

Theorem 1. Black point p is simple in picture 26, 6, B) if and only if all 
of the following two conditions hold [9,14].- 

1. The set {B\{p}) n A^ 2 e(p) contains exactly one 26-component. 

2. The set {2Z^\B) n Nq{p) is not empty and it is 6-connected in the set 

We need to consider what is meant topology preservation when a number of 
black points are deleted simultaneously. Ma [7] and Kong [5] gave sufficient condi- 
tions for parallel reduction operations of 3D (26,6) pictures. Those require some 
additional concepts to be defined. Let 7^ be a picture. The set D = {di, . . . ,dk\ 
of black points is called a simple set of V if D can be arranged in a sequence 
(djj , . . . ,dif.) in which each di^ is simple after {dij, . . . , dq.i} is deleted from 
V, for j = 1, ... ,k. (By definition, let the empty set be simple.) A unit lattice 
square is the set of four corners of a unit square embedded in a unit lattice 
cube is the set of eight corners of a unit cube embedded in . 

Theorem 2. A 3D parallel reduction operation is topology preserving for (26,6) 
pictures if all of the following conditions hold [5,7].' 

1. Only simple points are deleted. 

2. If two black corners, p and q, of a unit lattice square are deleted, then the 
set {p, q} is simple. 

3. If three black corners, p, q, and r, of a unit lattice square are deleted, then 
the set {p, q, r} is simple. 

4-. If four black corners, p,q,r, and s, of a unit lattice square are deleted, then 
the set {p, q, r, s} is simple. 

5. No black component contained in a unit lattice cube can be deleted completely. 

3 The new algorithms 

Each 6-subiteration directional thinning algorithm uses the six deletion direc- 
tions that can delete certain U-, D-, N-, E~, S-, and W-border points, re- 
spectively [2,3,6,11,13,15]. In our 8-subiteration approach, three kinds of border 
points can be deleted in each subiteration. The 8 directions are denoted by 
USW, UWN, UNE, UES, DSW, DWN, DNE, and DES. Note that each 
triple corresponds to the three points that are 6-adjacent to the point p and 
18-adjacent to each others (see Fig. 1 (a)). The usual 6 and the proposed 8 
deletion directions are illustrated in Fig. 1 (b) and Fig. 1 (c), respectively. 

Suppose that the 3D (26,6) picture to be thinned contains finitely many 
back points. Therefore, it can be stored in a finite 3D binary array. (Outside 
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of this array every point is white.) Reduction operations associated with the 
8 subiterations are called deletion_from_USW, ... , deletion_f rom_DES. We are 
now ready to present the 8-subiteration approach formally. 

Input: binary array X that represents the picture to be thinned 
Output: binary array Y that represents the thinned picture 

8-subiteration_thinning(X ,y) 
begin 

r = X; 

repeat 

Y = deletionjfrom_USW(y); 

Y = deletionjfrom_DNE(y); 

Y = deletion_from_USE(y); 

Y = deletion_from_DNW(y); 

Y = deletion_from_UNE(y); 

Y = deletion jfrom_DSW(y); 

Y = deletion jfrom_UNW(y); 

Y = deletion_from_DSE(y); 
until no points are deleted] 

end. 

Note that choosing another order of the deletion directions yields another 
algorithm. The proposed order shows a kind of symmetry, therefore, the thinned 
objects are in their geometrically correct positions (i.e., the “middle” of the 
original objects). 

Now, the successive parallel reduction operations are to be given. We pro- 
pose a curve thinning algorithm and a surface thinning algorithm. The deletable 
points (that are simple points and not end points) are given by sets of 3 x 3 x 3 
matching templates. A black point is deletable if at least one template in the set 
of templates matches it. 

We now give the characterizations of the curve-end points and the surface- 
end points. 

Defiuitiou 3. A black point p is a curve-end point in a picture , 26, 6, B) if 
the set (A^ 2 e(p) H B)\{p} is singleton (i.e., p is 26-adjacent to exactly one black 
point). 

Defiuitiou 4. A black point p is a surface-end point in a picture 26, 6, B) 
if the set Nq{p)\B contains at least one opposite pair of points (see Fig. 2). 
(Note that each curve-end point is a surface-end point.) 

Note that different surface-end point characterizations have been proposed 
by various authors [1,2,3,6,8,15]. 

A set of seven base templates is given by Fig. 3. The set of templates 

is assigned to the first subiteration of our curve thinning algorithm. It can 
be get from the base templates by the following rules: 

1. if template t G then t G 
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Fig. 2. The three configurations assigned to surface-end points. Every position marked 
is a black point, every position marked “o” is a white point, and every (“don’t 
care”) may be either a black or a white point. 

2. if template ti G T^sw template t 2 is the reflected version of t\ with 
respect to the three symmetry planes illustrated in Fig. 4, then t 2 G T^sw 




Fig. 3. The set of 7 base templates assigned to the first subiteration of curve 

thinning. Notations: every position marked matches a black point, every position 
marked “o” matches a white point, every (“don’t care”) matches either a black or 
a white point, at least one position marked “x” matches a black point. 

It is easy to see that T^sw contains 22 templates for deleting certain U-, S~, 
or W-border points that are not curve-end points. Note that template positions 
marked “x” are used for preserving curve-end points. These templates define the 
parallel reduction operation deletion_from_USW. The deletion rules assigned to 
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the other seven subiterations can be derived from the appropriate rotations and 
reflections of the templates in 




Fig. 4. The three symmetry planes for reflecting templates. Points belonging to the 
given planes are marked 



The set of four base templates given by Fig. 5 belongs to the first 

subiteration of our surface thinning algorithm. The whole set of templates T^sw 
can be derived in the same way as it was in the case of the curve thinning 
algorithm (with the help of the 3 symmetry planes). It is easy to see that 
contains 15 templates for deleting certain U-, S~, or W-border points that 
are not surface-end points. Note that template positions marked “x” and “y” 
are used for preserving surface-end points. The sets of templates assigned to 
the other seven subiterations can be derived from the appropriate rotations and 
reflections of the templates in %fgw Mention is to be taken that the set of 
templates T-^sw is derived from Tf^g^ by changing the end point criterion. 

Both sets of templates and T(fsw) were constructed for deleting some 

simple points which are neither end points nor extremities of curves/surfaces. 
Note that the same template is included in more sets of templates: for instance, 
template B3 in is in sets of templates assigned to deletion directions 

UWN, UNE, and UES, too. 

4 Discussion 

Different shapes of objects have been tested by the new algorithms. Here we 
present only three examples (see Figs. 6-7). 

The behavior of the proposed algorithms has been also investigated. Here 
we present an example in Fig. 8 for thinning a “noise-free” solid doughnut of 
size 40 X 40 X 20 and its noisy version. Noise was added to the boundary (i.e., 
some border points are deleted from the original noise-free object and some 
white points 26-adjacent to a border point are changed to black). Note that we 
consider a model of noise which consists of deleting and adding simple points. 
We can state that the proposed curve thinning algorithm is robust under noise. 
Unfortunately, the 8-subiteration surface point algorithm is rather sensitive to 
boundary noise, since a noisy boundary may contain a number of surface-end 
points to be preserved. 
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Fig. 5. The set of base templates Bhsw is derived from Z?usw It is assigned to the 
first subiteration of surface thinning. Notations: every position marked matches a 
black point, every position marked “o” matches a white point, every (“don’t care”) 
matches either a black or a white point; at least one position marked “x” matches a 
black point, at least one position marked “y” matches a black point. 



The proposed algorithms go around the object to be thinned according to 
the 8 deletion directions. One can say that the 8-subiteration approach seems 
to be slower than the 6-subiteration one. It is not necessarily true. Suppose that 
our sample object is a cube of size 16 x 16 x 16. An iteration step of any 6- 
subiteration algorithm can generate a 14 x 14 x 14 cube from the original one. 
A cube of size 8 x 8 x 8 is created by the 8-subiteration algorithms. According 
to our experiments, less iteration steps are required by the proposed algorithms. 

The proposed 8-subiteration algorithms are topology preserving for (26, 6) 
images. It is sufficient to prove that reduction operation given by the set of 
templates is topology preserving. If the first subiteration of the curve is 

topology preserving, then the other seven ones are topology preserving, too, since 
the applied rotations and reflections of the deletion templates do not alter the 
topological properties. Therefore, the entire curve thinning algorithm is topology 
preserving, since it is composed of topology preserving reductions. The surface 
thinning algorithm is topology preserving, too, since its subiterations can delete 
less kinds of border points. 

Instead of using Theorem 2, we propose the following more general theorem 
that provides new sufficient conditions for 3D parallel reduction operations to 
preserve topology. 

Theorem 5. Let T he a parallel reduction operation on (26, 6) pictures. Then 
T is topology preserving, if for all picture V = {2Z^ , 26, 6, B), all of the following 
conditions hold: 
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Fig. 6. Thinning of two synthetic objects (left); the results of surface thinning (centre); 
the results of curve thinning (right). (Cubes represents black points.) 



1. for all points p G B that are deleted hyT and for all sets Q C (A^ig(p)\{p})n 
B that are deleted by T, p is simple in the picture ,26,6, B\Q); and 

2. no black component contained in a unit lattice cube can be deleted completely 
by T. 

Proof, (sketch) We show that condition 1 of Theorem 3 implies conditions 1-4 
of Theorem 2. 

It is obvious that the set Nis{p) contains any unit lattice squares in which p 
is a corner. Point p is to be regarded as the last element of the simple sequence 
of corners while the preceding ones are in set Q. If Q = 0, then we get Condition 
1 of Theorem 2. 

Condition 2 of Theorem 3 corresponds to condition 5 of Theorem 2. □ 

In order to prove both conditions of Theorem 3, we classify the elements of 
templates and state some properties of the set of templates Tfisw 

The element in the very centre of a template is called central. A non-central 
template element is called black if it is marked A non-central template 
element is called white if it is marked “o”. Other non-central template element 
which is not white and not black, is called potentially black. A black or a poten- 
tially black non-central template element is called non-white. 

Let us state some properties of the set of templates 
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Fig. 7. Thinning of a ventricle extracted from a (greyscale) 3D MR brain study (left); 
the result of surface thinning (centre); the result of curve thinning (right). 



Observation 6. Let us examine the configuration of 3 x 3 x 3 points illustrated 
in Fig. 9. If point q is black or each point marked r is white, then black point p 
cannot be deleted by 

Observation 7. Let us examine the configurations of 3 x 3 x 3 points illustrated 
in Fig. 10. If both points p and q are black and q can be deleted by then 

point r is black and each point marked s is white. 

Observation 8. Let us examine the configurations of 3 x 3 x 3 points illustrated 
in Fig. 11. If both points p and q are black and q can be deleted by then 

one of the following cases holds: 

— point ri is black and both points marked Si are white and both points 
marked S 2 are white and both points marked S 3 are white; 

— point T 2 is black and both points marked si are white and both points 
marked S 4 are white and both points marked S 5 are white; 

— point rs is black and each point marked Si {i = 1, . . . , 5) is white; 

— both points ri and T 2 are black and both points marked si are white and 
both points marked S 2 are white; 

— both points r\ and T 2 are black and both points marked si are white and 
both points marked S 4 are white. 

Observation 9. Let us examine the configuration of 3 x 3 x 3 points illustrated 
in Fig. 12. If black point p can be deleted by then at least one point 

marked q is black. 

Theorem 10. Reduction operation given by the set of templates topol- 

ogy preserving. 

Proof, (sketch) It is easy to see that each template in deletes only simple 

points of (26,6) pictures. 
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Fig. 8. Noise sensitiveness of the proposed algorithms. Thinning of a “noise-free” solid 
doughnut and its noisy version (left); the results of surface thinning (centre); the results 
of curve thinning (right). 
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Fig. 9. Configuration assigned to Observation 1. 



The first statement is to verify that there exists a 26-path between any two 
non-white positions (condition 1 of Theorem 1). It is sufficient to show that 
any potentially black position is 26-adjacent to a black position and any black 
position is 26-adjacent to another black position (if the template contains at 
least two black positions). It is obvious by careful examination of the templates 
in 

■‘usw 

Let us examine condition 2 of Theorem 1. To prove it, it is sufficient to show 
for each template in 
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Fig. 10. Configurations assigned to Observation 2. 
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Fig. 11. Configurations assigned to Observation 3. 



1. that there exists a white position 6-adjacent to the central position, 

2. that for any two white positions 6-adjacent to the central position p are 
6-connected in the set of white positions 18-adjacent to p, 

3. and that for any potentially black position 6-adjacent to the central position 
p, there exists a 6-adjacent white 18-neighbour which is 6-adjacent to a 
white position 6-adjacent to p. 

The three points are obvious by a careful examination of the templates in 

q-C 

•‘usw 

It can be stated that the value of any point coincides with a potentially black 
template position does not alter the simplicity of p. Therefore, it is sufficient to 
deal with sets Q containing points that can be deleted by TJsw coincide 
with black template positions. It is easy to see with the help of Observations 1-4 
that deletion of such sets Q does not alter the simplicity of point p. Therefore, 
Condition 1 of Theorem 3 is satisfied. 

Condition 2 of Theorem 3 can be seen with the help of Observation 1. □ 
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Abstract. This article introduces an order independent homotopic thin- 
ning for binary pictures. This thinning has the following properties: it 
does not use homotopic structuring elements, it is independent of the 
order in which pixels are processed, it is invariant through 7t/2 rotations, 
and it takes into account global characteristics of the image through 
Rouse’s criterion of strong 8-deletability. An algorithm implementing this 
new thinning is presented and applied to the skeletonisation of binary 
patterns. 



1 Introduction 

Image homotopic thinning is a usual pre-processing transformation in pattern 
recognition. Its aim is to provide a compact representation of the image while 
preserving its topological properties. The present paper introduces a homotopic 
thinning algorithm independent of the order in which pixels are processed. 

This paper is organised as follows. Basic definitions and notations are pro- 
vided in Section 2. Section 3 introduces the notion of independent simple pixels 
and proposes a characterisation of such pixels. Section 4 concerns the implemen- 
tation of the resulting thinning algorithm and the study of its properties. Before 
concluding, section 5 shows that order independent thinnings can be used for 
skeletonising binary patterns. 



2 Definitions and notations 

In this section we consider discrete binary images in the square grid. Some basic 
concepts like discrete image, discrete distance, and discrete neighbourhood are 
assumed to be known by the reader. Rigourous definitions of these concepts 
as well as a good introduction to digital topology can be found in [KR89] and 
[CM91]. In the following definitions and notations, X equals either 4 or 8. 

’’ This work has been initiated while the authors were with the LGI2P, Ecole des Mines 
d’Ales & EERIE (Nimes). 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 337-346, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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2.1 Neighbourhood and connectivity 

We denote by dx the ^-connected distance, by Nx{P) the set of X-connected 
neighbours of a pixel P, and by Nx{P) the set Nx{P) U P. The set of X- 
connected neighbour pixels of P with the value b {b G {0, 1}) is denoted by 
N^{P) and is defined as follows: N^{P) = \Ji{Pi G Nx{P) \ Im{Pi) = b}. 
Similarly, Nx(P) = N^(P) U P. 

These definitions are easily extended to an arbitrary set S of pixels: Nx{S) = 
Up^sNx(P), Nx{S) = Nx{S) \ S, n],{S) = Upes^x(^). and iV^(^) = 
Nx(S)\S. 

We call X-path a sequence of pixels such that each pixel of the sequence is 
a X-connected neighbour of the next one. The first and the last pixels of this 
sequence are called the extremities of the path. A X^-path is a A-path whose 
pixels have all the value b. 

In an image Im, two pixels Pq and Pn are X^ -connected iff there exists 
a A^-path between Pq and Pn- The relation to be X^ -connected is an equiv- 
alence relation. The corresponding equivalence classes define the A^-connected 
components of pixels. They are denoted by CCx- Finally, the A-connected com- 
ponent containing P will be denoted by CCx{P) and consequently CCx{P) = 
CC^{P)\P. 

2.2 Homotopy 

We will now place our study within the common practical framework [Ros70] 
by using the 4-connectivity for the background (pixels with the value 0) and 
the 8-connectivity for the foreground (pixels with the value 1). We also assume 
that the pixels outside of the image definition domain are background pixels. 
By doing so, we make sure that there exists a CC^, noted S outside, which is the 
supremum of the embedding relation C detailed hereafter. 

Let Si be a CC^, So a CCl and S' either a CC^ or a CCl Then, Si C S 
iff every 4-path linking a pixel of Si to a pixel of Soutside contains at least one 
pixel of S. Similarly, So C S iff every 4-path linking a pixel of So to a pixel of 
Soutside contain at least one pixel of S. 

The adjacency tree [Bun69] or homotopy tree [Ser82] of a binary image is a 
graph whose vertices represent the background and foreground connected com- 
ponents, whose root is Soutside &nd where the ascendents of a vertex V are the 
vertices corresponding to the connected components embedding V (i.e., two ver- 
tices are joined by an edge if and only if the corresponding embedded connected 
components are adjacent to each other [KR89]). Two 2-D binary images Imi 
and Im 2 are homotopic if and only if they have the same homotopy tree [Ser82]. 

Finally, a pixel is called deletable [Ros70] or simple [Kon95] if and only if 
the image obtained by removing this pixel (i.e. changing its value from 1 to 0) 
is homotopic to the original image. We call simpleness property the property of 
being simple. Depending on whether a pixel is simple or not, the simpleness test 
will return true or false. Rosenfeld’s characterisation of a simple pixel [Ros70] 
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provides us with a practical definition for implementing the simpleness test. 
Indeed, he has shown that a pixel P of a 2-D binary image Im is simple if and 
only if the two following conditions hold: 

1. The set of 8-connected foreground neighbours of P, Nl(P), is nonempty and 
8-connected; 

2. The set of 4-connected background neighbours of P, N2{P), is nonempty. 

3 Characterisation of order independent simple pixels 

3.1 The notion of independence 

A trivial order dependent homotopic thinning algorithm applied to a binary 
image Im is 



for each pixel P of I m 
if P is simple in I m 

add P to the set Simpleset of simple pixels 
for each pixel P G Simpleset (arbitrary sequential scanning order) 
if P is simple in I m 
Im{P) ^ 0 



This algorithm will be referred to as Algol . At the end of Algol, the input 
image Im contains a thinned image which is homotopic to the original image. 
Unfortunately, the resulting image depends on the scanning order of the set of 
simple pixels. An order-independent independent homotopic thinning algorithm 
can be obtained by setting to 0 those pixels of the original image which are set to 
0 by Algol whatever the considered scanning order (i.e., for all possible scanning 
orders). We call these pixels the order independent simple pixels and they are 
said to be independent. 



3.2 Independence with regard to a single pixel 

We will first consider the case where a simple pixel P has only one simple pixel Q 
in its neighbourhood. In this particular case we will say that P is (in)dependent 
of Q in the sense that the removal of P is (in)dependent of the order in which the 
simple pixels are computed. In such a case the only two possible configurations 
of the neighbourhood of P are: (i) the one with Q = 1, obtained by all scanning 
orders, where P is processed first, and (ii) the one with (5 = 0 obtained by all 
scanning orders where Q is the first pixel processed. 

Lemma 1. A simple pixel P is dependent of a simple pixel Q if they have one 
of the following configurations (plus their rotations through 7r/2j.- 
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* = 0 or 1. 



Proof. We have to show that the result of the simpleness test of P will be false 
if Q has the value 0. 

— For the first two configurations, P is simple, this implies that Ng{P) is 8- 
connected. Therefore, iVg (P) contains a single pixel Q and if Q is set to 0, 
-/Vg (P) will be empty. Consequently, P will no more satisfy the first Rosenfeld 
test. 

— For the four other configurations, P is simple, this implies that N2{P) is 
nonempty and therefore if Q is set to 0, iVg (P) will no longer be 8-connected. 
Again it follows that P will no more satisfy the first Rosenfeld test. 

□ 



Theorem 1. A simple pixel P is independent of a simple pixel Q iff they have 
one of the following configurations (plus their rotations through (2): 





Proof. 

— From right to left: 

• First Rosenfeld test: in these configurations, P has an 8-neighbour with 
the value 1 different of Q and Q plays no role in the connectivity of 

Ni(P). 

• Second Rosenfeld test: P is simple so N2{P) is nonempty and this will 
not change by setting Q to 0. 

This demonstrates that the simpleness test of P is independent of the value 
of Q: it returns true whatever the value of Q. 

— From left to right: if P and Q are in an other configuration they are in one 
of the configuration of lemma 1 and therefore P is dependent of Q. 

□ 
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3.3 Strict independence 



Let us now consider the general case where there are n simple pixels Qi, . . . , Qn 
in the neighbourhood of P. We denote by Qi any one of these simple pixels. 

If P is dependent of Qi, P is not removed for sequences starting by Qi. This 
implies that for being independent, P must be independent of all its simple 
neighbours. 

By assuming that the ‘1’ pixels shown in the configurations of theorem 2 are 
all nonsimple (i.e., they do not match any Qi), we define a strict independence 
between P and its simple neighbours in the sense that the simpleness test of P 
is independent of the value of its simple neighbours. This guarantees that, if the 
‘1’ pixels in the configurations shown in theorem 2 are nonsimple pixels, P will 
always be removed by Algol. 

Now, the only configurations for which we have not yet a result are those 
where P is independent of all the Qi but there exists at least one of the Qi 
noted Qs such that P is independent of Qs but not strictly. In such a case, the 
simpleness test of P depends on the value of its simple neighbours. Nevertheless, 
since these pixels are themselves interdependent, all combinations are not pos- 
sible. This restricts the neighbourhood configurations of P. All configurations 
where P is independent of Qs but not strictly are the configurations of indepen- 
dence such that the only possible matching between these configurations and 
the neighbourhood of P implies that at least a ‘1’ pixel matches with a pixel of 
{Qi, . . . , Qn} \ Qs- These configurations are the following (plus their rotations 
through 7 t/2): 




QI 


Qs 


p 


Q2 



0 


Qs 


P 


QI 



In those configurations P,Qs,Qi, and Q 2 are simple pixels. We must keep in 
mind that in Algol the first processed simple pixel has the neighbourhood found 
in the original image. Therefore, it is always simple and will be removed. In 
addition, the simple pixels processed after P have the value 1 when processing 
P. Assuming that P is independent, we find the following restrictions (we denote 
by sequences{P, R) the set of scanning sequences of Algol having P as first pixel 
and R as second): 

— configuration 1: P must be simple for sequences{Q\, P) this implies that all 
the not displayed neighbours of P must have the value 0. Otherwise they 
would not be connected with Q^. Therefore the second Rosenfeld test would 
fail and P would not be removed. 

— configuration 2: as P is simple, it has a 4-connected neighbour with the 
value 0. Repeating twice the same reasoning than in configuration 1 for 
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sequences{Qi, P) and for sequences{Q 2 , P) , we deduce that all the not dis- 
played neighbours of P have the value 0. 

— configuration 3: the fact that P is simple implies that its not displayed 
neighbours have the value 0. 

— configuration 4: sequences{Qi, P) implies that the not displayed neighbours 
of P have the value 0. 

— configuration 5: sequences{Qi, P) implies that the not displayed 8-neighbours 
of P have the value 0. Consequently, the fact that Qs and Qi are simple im- 
plies that their not displayed neighbours have also the value 0. Therefore, 
for sequences{Qs, Qi,P), Qs is removed because it is simple. This does not 
change the fact that Qi is simple and thus Qi is also removed. Hence, iVg (P) 
is empty and therefore P is not removed. 

3.4 Ronse’s theorem 



Following the restrictions detailed in the previous section, the only configurations 
that still need to be considered are (plus their rotations through tt/2): 




In these configurations it is obvious that the only cases where P is not simple 
are the cases where all the Qi have the value 0. Moreover, every 8-path linking 
P to CC^{P) \ {Qi , . . . , Qn} contains at least one Qi pixel. Therefore, to allow 
the last treated Qi to be removed, all other pixels of CCg{P) must have already 
been removed. Otherwise, the last Qi pixel is not simple and will not be removed. 
This implies that C'C'g(P) contains only simple pixels. In conclusion, the two 
conditions of the existence of a scanning order in which P is not removed by 
Algol are the following: (i) CC^{P) contains only simple pixels and (ii) it exists 
a scanning order of simple pixels such that all pixels of CC^{P) are removed 
before P. The first condition is easy to test and the second one can be answered 
by Ronse’s characterisation of 8-strongly deletable set for a nonempty set of 
8-connected pixels: 

Theorem 2. [Ron86] let Im be an image, let D he a set of 8-connected fore- 
ground pixels of Im, let t = \D\ and Im' be the image obtain from Im by 
removing all the pixels of the set D. The following statements are equivalent: 

1. D is strongly 8-deletable from Im 

2. D corresponds to Ronse’s characterisation of a 8-deletahle set 

— the set iVg(P) belongs to the same CCg of Im' 

— given D\, . . . , Dn the CCl of D, for all i in {1, , n}, the set N2(Di) 
belongs to the same CC'f of Im. 

3. we can label the pixels of D, Qi, . . . , Qt, in such a way that each Qi is simple 
in Im \ {Qj \ j < i}. 








Order Independent Homotopic Thinning 343 



4 The order independent thinning algorithm 

4.1 Characterisation of order independent simple pixels 

We will sum up the previous section to show that in all cases we have either 
found a sequence where P is not removed or it will be removed for all possible 
sequences. Indeed, given a simple pixel P: 



if P is dependent of one of its simple neighbours, 

then we have found an order where P is not removed, 
else 

if P is strictly independent of all its simple neighbours, 
then we have shown that P will always be removed, 
else 

if P and Q are not in one of the configurations shown in section 
3.4, 

then there exists an order where P is not removed, 
else 

if CCg(P) contains only simple pixels and is 8-deletable, 

then Ronse’s theorem guarantees that there is an order where 
P is not removed. 

else we have proved that P will always be removed. 



4.2 Useful properties for the implementation 

— The simpleness is a local property. Thus, when iterating order independent 
thinnings, we can restrict our research for new simple pixels to the neigh- 
bourhood of the simple pixels deleted at the previous iteration. 

— Rather than directly considering all configurations and their rotations, we 
propose to summarise them with the following rules: 

1. P is independent of Q if and only if: 

• there is another foreground pixel within the intersection of their 8- 
neighbourhood , 

• when P and Q are 4-connected, there is a 4*^-path between them 
within the intersection of their 8-neighbourhood. 

2. P is strictly independent of Q if and only if: 

• P is independent of Q, 

• and moreover there is within the intersection of their 8-neighbourhood 
a nonsimple foreground pixel. 

3. The configurations requiring to test the 8-deletability of a set are those 
where 

• P is (not strictly) independent of Q, 

• and all neighbours of P which are not neighbours of Q have the 
value 0 (VP' G Ns{P) \ Ns(Q), Im{P') = 0). 
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— An important result is provided by a Rouse’s lemma 3.1 in [Ron86]: let Im 
be an image, D a set of 8-connected foreground pixels of Im, Im' = Im \ D, 
D' C D, and D" = D \ D' . Then if D is 8-deletable from Im and D' is 
8-deletable from Im, then D" is 8-deletable in Im' . 

This guarantees that, if a pixel is not independently simple because it is 
connected to a 8-deletable set, the deletion of simple pixels cannot change 
this fact. Hence, for each pixel P, we test the 8-deletability at most once 
because either P is independently simple (and is removed) or it is not (and 
will never be). 

4.3 The algorithm 

In its basic form, the order independent thinning algorithm iterated until idem- 
potence is as follows: 



while pixels are deleted 
detect the simple pixels 

delete the independent simple pixels. 

An optimal implementation is out of the scope of this paper. Nevertheless 
the previous section already provides an implementation having a complexity 
of 0{n^) where n is the number of pixels of the input image. Indeed, we test 
each pixel only when its neighbourhood changes, i.e., at most 8 times. The 8- 
deletability of a set can be done in 0{n) and is done at most once per pixel. The 
other tests are local, so finally the complexity of the whole algorithm is 

4.4 Properties of the order independent thinning algorithm 

This thinning algorithm has the following properties: 

1. it does not use structuring homotopic elements, 

2. it is independent of the order in which the pixels are processed, 

3. it provides a result invariant through 7 t/ 2 rotations, 

4. it is naturally adapted to a parallel implementation, 

5. it has a polynomial implementation in O(n^) where n is the number of pixels 
of the input image, 

6. it takes into account global characteristics of the image through Rouse’s 
theorem. 

By definition, our algorithm can be viewed as a parallel thinning algorithm. 
Thus the notion of 8-Minimal Non Deletable sets introduced by [Ron88] could 
also be used as an alternative to prove its correctness. 



5 The order independent thinning skeleton 

A common way to define a skeleton through homotopic thinning is to perform 
sequential homotopic thinnings until idempotence [Lan78,Ser82]. If the thinning 
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Original image 
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) 



Obtained skeleton: 
the white pixels 
are the anchors points 




The skeleton merged into 
the original image 



Fig. 1. Example of skeletonisation based on order independent thinnings and ultimate 
eroded set as anchor points. 



method deletes all simple pixels, each simply connected component will be re- 
duced to a single pixel. A first possibility to avoid this situation is to delete only 
specific configurations of simple pixels. This is usually done by considering a 
family of homotopic structuring elements. Consequently, the resulting skeleton 
depends on the chosen structuring elements, the order in which they are applied, 
and the order used for processing the image pixels. 

Another approach is to first characterise a subset of pixels belonging to the 
skeleton called the anchor points [Vin91,Sch94]. The anchor points are by def- 
inition nonsimple and cannot be deleted when thinning the image. By doing 
so, it is possible to thin the image with all possible homotopic structuring el- 
ements without reducing each simply connected set to a single pixel. However, 
the result still depends on the order used for processing the image pixels. This 
problem is eluded by considering order independent thinning. Some properties of 
the obtained skeleton directly depend on the chosen set of anchor pixels. A com- 
monly used set of anchor points is composed of the centres of the maximal disks 
which guarantee that the obtained skeleton is suited for the reconstruction of 
the original image, but it is very sensitive to small boundary variations. Another 
possibility for the anchor points is to use the ultimate eroded set or, equivalently, 
the regional maxima of the distance function [Mey89]. In both cases, each CC^ 
of the original image contains at least one anchor point, and at each step of 
the thinning algorithm the current image will have the same property. Thus we 
are sure that if P is simple (and is not an anchor point), C'Cg(P) contains at 
least one anchor point (i.e., one nonsimple pixel). Thus if a pixel is in one of 
the configurations of subsection 3.4 we are sure that it is independent. This is 
an important issue because it allows us to skip the 8-delet ability test and thus 
provides us with an algorithm in 0{n). 

If the computation of the set of anchor points is independent of the pixels 
treatment and invariant through tt/2 rotations, then the resulting order indepen- 
dent thinning skeleton inherits these properties (e.g., when using the centres of 
maximal disks or ultimate eroded set). Figure 1 shows an example of skeletonisa- 
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tion using ultimate eroded as anchors points. Notice that the obtained skeleton 
is not so thin as those obtained through other thinning approaches. Indeed, a 
single algorithm cannot simultaneously be independent of the order in which 
pixels are processed and lead to one-pixel thick result (incompatible properties). 

6 Conclusion and perspectives 

We have introduced a homotopic thinning approach which does not require to 
choose a family of homotopic structuring elements. Due to its independence with 
regard to the order in which pixels are processed, it allows one to skeletonise a 
binary figure without making any arbitrary choice. Our current research aims at 
studying order independent thinnings in the case of 3-D binary images. 
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Abstract. The model of computable partial solids has been recently in- 
troduced in order to address computational geometry and solid modeling 
issues within the Turing model of computation. This approach provides 
a model that reflects well the observable properties of real solids and the 
computation on realistic computers [5] . Since a central notion of discrete 
geometry, voxel sets, can be used to define computable partial solids, this 
approach throws a bridge between discrete geometry and solid modeling 
in This paper presents this model and the recursive analysis and 
domain theory prerequisites. 



1 Introduction 

Discrete geometry is used to model “continuous geometry”. Discrete geometry 
involves finite data only and can then be handled by realistic computers, or, 
more theoretically, by Turing equivalent (for the sake of computability) com- 
puters. This situation is quite common in practical computations. Elements of 
countable sets are used to model or approximate elements of uncountable sets 
whose representation would involve infinite data. 

The availability of various data type modeling real numbers (floating point 
as well as more elaborate real numbers arithmetics) has led to design algorithms 
assuming a machine model, the “Real RAM” , able to perform exact real num- 
bers computations [12,2]. This model of computation is realistic as long as the 
computed operators are continuous, since discontinuous operators as the real 
numbers comparison predicate are not Turing computable [11]. Actually, practi- 
cal implementations of geometric algorithms which correctness have been proved 
under the assumption of the real RAM machine model often suffer of so called 
“robustness problems” . 

To address computational geometry and geometric modeling computations 
with the Turing machine model, one disposes of existing mathematical models 
of computation over uncountable sets, as recursive analysis and domain theory. 
In this context, a model for representing solids, the solid domain, has been built 
using rational polyhedra or voxels sets [9,10,5]. This model connects geometry 
of solids in R” with discrete geometry. 

In section 2, the properties of computation over uncountable sets are intro- 
duced informally in order to motivate the point of view of recursive analysis. In 
section 3, some elements of recursive analysis and domain theory used in sec- 
tion 4 are outlined. Section 4 presents the solid domain and some properties of 
computations over solids due to a common work with Pr. Abbas Edalat [6]. 
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2 Computation over uncountable sets 

Actual computers deal with finite data only. Each element of a countable set, as 
an integer or a rational number for example, can be represented by a finite word 
over a finite alphabet. Computing such an element means computing its finite 
representation. Because finite length words can represent elements of countable 
sets only, computation over elements of uncountable sets needs a more elaborate 
definition. 

Let us consider as first example the number tt. Even if it belongs to the 
uncountable set of real numbers, it is reasonable to say that there exists programs 
able to compute it. In this case, what one asks from such a program is to be able 
to compute an arbitrary accurate approximation of tt. For example, a program 
that, given any integer i as input, computes a dyadic number di such that \tt — 
di\ < is said to compute tt (a dyadic number is a rational number whose 
denominator is a power of 2) . 

Let us take as second example the computation of the solution of a PDF (for 
Partial Differential Equation) by a Ritz method. The weak form of a PDE is a 
minimization problem in an infinite dimensional real (or complex) vector space 
F of functions. Numerical methods use a sequence s = {si,i G N} such that 
finite linear combinations of elements of s are dense in F. An algorithm is said 
to compute the solution if it is able to compute, given any integer i as input, 
an up to 2“* approximation of the solution for a given metric^. The approxima- 
tions are finite linear combinations of elements of s. In actual computations, the 
coefficients of these finite linear combinations of elements of s are often floating 
point numbers. From a theoretical point of view, it is enough to assume that 
these coefficients belong to a dense subset of R as for example dyadic numbers. 

We can understand with this two simple examples two major ingredients of 
computation of an element of an uncountable set E : 

— because computing such an element means computing it up to an arbitrary 
accuracy, one see that computability over uncountable sets depends on a 
metric or at least a topology on E. 

— because approximations have to be represented by finite words, they have 
to belong to a countable set C and, to avoid a priori restriction in the set of 
approximable elements, it is natural to ask from C to be dense in E. 

The computations of approximations of subsets of R" by subsets of N", or 
voxel sets, is still another example, addressed in section 4. 

With this point of view of computation, an operator O is computable if 
there exists a program that, given any input I, is able to compute an arbitrary 
accurate approximation of 0{I) using an arbitrary accurate approximation of 
/. This informal formulation of computability shows the important proposition 
“computable implies continuous”. This point of view is formalized in the 
next section. 

^ The “right” metric depends on the nature of the PDE. For elliptic PDE, the set F 
may be a Sobolev space with its associated metric. Within finite elements methods, 
for example, the sequence s is usually made of piece-wise polynomial functions basis. 
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3 Recursive analysis and domain theory 

In this section, we give a few definitions used in subsequent sections. 



3.1 Recursion theory 

Computable functions from N to N are the partial functions that can be com- 
puted by a Turing machine^. We use the equivalent notion of partial recursive 
functions from N to N as a map from N to N that can be computed by a general 
purpose computer or a (finite length) program written in a general purpose lan- 
guage. These functions are partial since, for some integer values given as inputs, 
the program may run for ever. We say that a partial recursive function is recur- 
sive if it is total, that is if it ends within a finite number of steps of computation 
for each integer given as input. 

A subset of N is said r.e. (for recursively enumerable) if it is empty or if it 
is the range (that is the image set) of a recursive function. A subset A of N is 
r.e. if and only if there exists a partial recursive function such that /a(*) is 
defined and equal to 1 if and only if i G A. 

A subset of N is said recursive if itself and its complement are r.e.. A subset 
A of N is recursive if and only if there exists a recursive function f^ such that 
fA{i) equal 1 for t G A and 0 for t ^ A. There are r.e. sets that are not recursive, 
but their construction is not trivial. 

Because there exists computable, one-to-one maps between N and for 
each k, the definitions above (partial recursive and recursive maps, r.e. and 
recursive subsets) extends naturally to maps from to and to subsets of 
N^. These notions apply similarily to countable sets. 

3.2 Recursive analysis 

Recursive analysis [11] could be defined as the extension of recursion theory 
to computation over uncountable sets. Since Q, the set of rational numbers, is 
countable, it is in one-to-one correspondence with N and we can write Q = 
{< 7 „|n G N} where = Qm if and only if n = m. A real number x is said 
computable if and only if there exists a recursive function / such that Vn G 
N, \x — qf{n)\ < 2 ”(^). Since the set of recursive functions, as the set of finite 
programs, is countable, there are non computable real numbers. 

In a metric space, a sequence (x„)„gN is said to converge effectively toward 
X if and only if there exists a recursive function / such that 

\/n,m G N, TO > /(n) d{x,Xm) < 2“" 

Note that, in the definition of a computable real number, one do not ask only 
for the existence of a computable sequence of rational numbers converging to x: 

^ See the Godel-Kleene theorem [1]. 

® This definition has been given by Turing in 1937. 
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the convergence have to be effective. A real number is said lower (respectively 
upper) semi- computable if and only if it is the limit of an increasing (respec- 
tively decreasing) computable sequence of rational numbers. A real number is 
computable if and only if it is lower semi-computable an upper semi-computable. 

A function / from [0, 1] to R is said computable if it is the effective limit, for 
the sup norm, of a computable sequence of polynomials with rational coefficients. 
It can be shown that an equivalent definition is that there exists a program able, 
for any real x in [0, 1] and any integer i given as input, to compute : 

— an integer j, 

— an up to approximation of f{x) using an up to 2~^ approximation of x. 

It follows that such a function is continuous. 

A general theory of computability over uncountable sets has been developped 
[16], based on a modified Turing machine, the TTE (for Type Two theory of 
Effectivity), whose inputs and output are infinite sequences of symbols. The solid 
domain and other notions introduced here could be defined in this framework 
(see [3]). However, we use hereafter a domain-theoretic framework because, even 
if it is, in some sense, equivalent, domain theoretic approach highlights the links 
between computability and general topology. It allows to build up most of the 
mathematical structures in the framework of partial orders and general topology, 
introducing lately the “Turing computability ingredient” through the notions of 
r.e. subsets of N and recursive functions. 

3.3 Domain theory 

Domain theory was originally introduced independently by Scott [15] as a math- 
ematical theory of semantics of programming languages and by Ershov [4] for 
studying partial computable functionals of finite type. 

A domain is a mathematical structure for data types or operators represent- 
ing incomplete or uncertain information. It is a partially ordered set (D, C) where 
the partial order corresponds to some notion of information. “A C B” means 
“the information represented by A is contained in the information represented 
by B”. 

A simple example is the domain {tt, ff, T} of the Boolean values tt and ff to- 
gether with a least element T below both. One thinks of T here as the undefined 
Boolean value that represents “no information at all” . 

Another example is the interval domain I[0, 1]. It is the set of all non empty 
real intervals [a,b], with 0 < a < 6 < 1. The information represented by [a,b] 
can be seen as the information concerning a real value x: x € [a, 6]. The in- 
formation order C is then the reverse inclusion order :[a, 5] C [^^^ 

[a; G [a\b'] x G [«,&]] [a' ^b'] C [a, 6] a < a' and b' < b. There 

is a bottom element T in I[0, 1], that is an element below any other: T = [0, 1]. 

Formal definitions. Let us consider a partial order (P, C). A subset Z\ of P is 
said directed if an only if : 

Vx, y G A, 32 G A, x Q z and y Q z 
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With the interpretation of □ as an information order, a directed subset is a set 
of consistent values: given two values x and y of the subset, the information 
represented by x and y is consistent, since it is refined by a value 2 : in the set. 

A directed complete partial order, or dcpo is a partial order such that each 
directed set has a lower upper hound, or lub. With the interpretation of C as an 
information order, in a dcpo the information contained in any directed set can 
be represented by an element of the dcpo, the lub of the subset, dcpo’s are also 
called domains. 

There is a TO"^ topology associated to dcpo’s: the Scott topology. A subset O 
of a dcpo {D, C) is Scott open if and only if : 

— It is upward closed: x G O and x ^ y ^ y G O, 

— and if it contains the lub 5 of a directed subset A, it contains an element of 

A: S eo ^3x e A, x e O 

This topology defines continuous functions among dcpo’s or among dcpo’s and 
topological spaces. One can show that a map / between dcpo’s is continuous 
(that is Scott-continuous) if and only if / is monotonic and / preserves lub’s, 
that is, the lub of the image by / of a directed set A is the image by / of the 
lub of A. 

For example, one can check that the two examples above ({tt,ff, T} and 
I[0,1]) are dcpo’s. Open sets of {tt, ff, T} are 0, {tt}, {ff}, and {tt,ff, T}. 

The set of all the [a, b] intervals included in a given open® subset O of [0, 1] is 
a Scott open subset of I[0, 1] . The set of all such sets is a basis of the (Scott-)open 
sets of I[0, 1]. 

Given two elements x,y in a dcpo D, we say that x is way below y, or x 
approximates y, denoted by x <C j/ if, for any directed subset A with a lub 5\ 
y C 5 G A, X Q z. One can check that, in I[0, 1], [a,b] « [a',b'] 

[a' , h'\ c]a, h[ a < a' and b' < b. 

An element x of a dcpo D is said maximal if there is no other element above 
x: Wy G D,x Q y ^ X = y. For examples, the maximal elements of {tt,ff, T} 
are tt and ff. The maximal elements of I[0, 1] are the [x, x] intervals. The set of 
maximal elements of I[0, 1] can then be identified to [0, 1]. 

A continuous domain [7] is a dcpo {D, C) such that, for every element x G D: 

— the set A = {y e I y <C x} is directed, 

— and X is the lub of X. 

It can be shown that I[0, 1] is a continuous domain. A subset S of a continuous 
domain {D, C) is a basis of D if, for each element x of D: 

— the set Bx = {b G B\ 6 <C x} is directed, 

— and X is the lub of B^. 

^ A TO topology is a topology that satifies the weakest separation axiom. 

® Open for the topology induced by the standard topology of R. 
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An u- continuous domain is a continuous domain with a countable basis. An 
element of an w-continuous domain can be expressed as the lub of a sequence 
of basis elements approximating it. One can see the relation with the informal 
description of computation over uncountable sets given in section 2. I[0, 1] is an 
w-continuous domain. Examples of countable basis are the sets of intervals [p, q\ 
where p and q are rational or dyadic numbers. 

3.4 Computability over effectively given continuous domains 

Recently, w-continuous domains has been used to define computability structures 
[7,8]. 

We say that an w-continuous domain {D, C) with a basis B is effectively 
given with respect to a given enumeration of B = {bo, b\, &2, • • •}, with bo = _L, 
if {{i,j)\ bi bj} is an r.e. subset of N^. 

We say that an element a; of I? is computable if the set {i| bi <C x} is r.e.. 
This is equivalent to say that there exists a recursive function / such that the 
sequence of (6y(j))igN is increasing and x is the lub of the sequence. An element 
[a, b] of I[0, 1] is computable if and only if a is lower semi-computable and b is 
upper semi-computable. A maximal element [x, x\ is computable if and only if x 
is a computable real number. 

A map / between effectively given w-continuous domains D and E with 
the respective bases {do: di,d 2 ,- • and {eg, Ci, 62, • • •} is computable if the set 
{{i,j)\ Ci f{dj)} is an r.e. subset of N^. 

3.5 I[0, 1]" 

The interval domain I[0, 1]" of [0, 1]" is the set of all non-empty n-dimensional 
sub-rectangles in [0, 1]” ordered by reverse inclusion. It is an w-continuous do- 
main. A basic Scott open set is given, for every open subset O of R”, by the 
collection of all rectangles contained in O. The set of maximal elements is the set 
of {x} for X G [0, 1]”. The set of sub-rectangles with dyadic coordinates provides 
a countable basis of I[0, 1]". A basis element of I[0, 1]” may represent a voxel (or 
a rectangular set of voxels). 

4 The solid domain 

4.1 Continuity 

For any subset A of a topological space. A, A°, dA and A° denote respectively the 
closure, the interior, the boundary and the complement of A. The regularization 
of a subset A is defined, by Requicha [13,14], as the subset A°. We say that a 
set is regular if it is equal to its regularization. 

The solid domain S[0, 1]” of the unit cube [0, 1]” C R” is the set of ordered 
pairs (A,B) of compact subsets of [0,1]” with AU B = [0,1]” endowed with 
the information order: (Ai,Bi) C (A2,i?2) A2 C Ai and B 2 C B\. The 

elements of S[0, 1]” are called partial solids. 
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Proposition 1. (S[0, !]”,□) is a continuous domain. 

The meaning of the way below or approximation relation between partial solids 
is then given by the following proposition: 

Proposition 2. For any (A,B) and in S[0,1]” 

{A, B) < {A', B') ^ A' c and B' c B° 

The next proposition shows that the solid domain could be equivalently de- 
fined as the quotient of the power set of [0,1]" under the equivalence relation 
X = Y <1=^ {X° = Y° Sz X = Y) equiped with the corresponding ordering. 

Proposition 3. For any {A, B) e S[0, 1]”, there exists a subset Y of [0, 1]" such 
that: A = Y and B = Y‘^. 

One can take for example: Y = {A \ B) U dA U (Q" n (A n B)°). 

Given any subset X of [0, 1]”, the classical membership predicate G: [0, 1]” ^ 
{tt.ff} is continuous except on dX. It follows that the best continuous approx- 
imation of this predicate is g': [0,1]" ^ {tt,ff,T} where the value T is taken 
on dX (recall that any open set containing T contains the whole {tt, ff, T} set). 
Conversely, any continuous function from [0, 1]" to {tt, ff, T} is the membership 
predicate of a partial solid. The solid domain could then be equivalently defined 
as C([0, 1]", (tt, ff, T}), the set of continuous maps from [0, 1]" to {tt,ff, T} with 
the ordering induced by the order of {tt,ff, T} [5,6]. The following proposition 
connects the Solid domain to the notion of Requicha’s regular sets. 

Proposition 4. The maximal elements o/S[0, 1]" are precisely those that rep- 
resent regular sets. In other words, maximal elements are of the form {A, B) 
such that A and B are regular with B = A‘^ and A = B'^. 



Hausdorff and Lebesgues metrics. Two metrics over partial solids are used 
in subsequent sections. The Hausdorff metric dx between compact sets is a 
natural metric for the purpose of approximation of compact sets: recall that 
the set K([0, 1]") of compact subsets of [0,1]" equiped with dK is a complete 
metric space (that is, any Cauchy sequence is converging). The metric induced 
on S[0, 1]" as a subset of [K([0, 1]")]^: 

dniiA, B), {A', B')) = max{dK{A, A),dx{B, B')) 

is also called (by an abuse of language) the Hausdorff metric over S[0, 1]". Be- 
cause S[0, 1]" is a closed subset of [K([0, 1]")]^ for the topology defined by the 
Hausdorff metric, S[0, 1]" is a complete metric space for dn. 

Now, given a compact subset X of [0, 1]" , the characteristic function of X, 
that is the function whose value is one on X and zero elsewhere, is denoted "Xx- 
The norm of \x is the Lebesgues measure of X. The following semi-metric 
dr, defined by 

dL{X,Y)=\\ Xx-Xy 111 
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is the Lebesgues measure of the set theoretic symetric difference between X and 
Y. It is not a metric as the Lebesgues semi-metric between two compact sets that 
differ by a zero measure set is zero. This semi-metric induces a semi-metric on 
S[0, 1]” that is called Lebesgues semi-metric here. It defines of course a metric 
on the associated quotient set. 



Boolean operators and generalized membership predicate. The truth 
tables of logical Boolean operators on {tt, ff, T} give us the definition of Boolean 
operators on the solid domain as we want to keep the natural definition of union: 
X & X yjY X € X or X G Y . Applying the same rule for intersection and 

complement give the definition of Boolean operators on partial solids: 

{Ai, Bi) LI {A2, B2) = (Ai U A2, i?i n B2) 

(Ai, i?i) n (A2, B2) = (Ai n A2, u S2). 

{A,Br = (B,A). 

The union operator over K([0,1]") is continuous for the Hausdorff metric 
while the intersection operator is not. The union and intersection operators are 
continuous (even Lipchitzian) for the Lebesgues metric over (the quotient of) 

K([0,1]”). 

We have then the following properies: 

Proposition 5. The U and n operators over S[0, 1]” are not continuous for the 
Hausdorff metric. 



Proposition 6. The U and n operators over the quotient o/S[0, 1]" are contin- 
uous for the Lebesgues metric. 

We have defined the continuous membership predicate for points of [0,1]". In 
order to be able to compute this predicate, we extend it to the interval domain 
I[0, 1]" by defining €: I[0, 1]" x S[0, 1]" ^ {tt, ff, T} with: 

r tt if A C 
G (A,(A,B)) = <^ ff if A C 

I T otherwise 



Proposition 7. The following maps are continuous for the Scott topology, in 
other words they are monotonic and preserve lower upper hounds: 

G: l[0,lj" X S[0,lj" ^ {tt,ff,T} 

U : S[0,lj" X S[0,1]" ^ S[0,lj" 
n : S[0,lj" X S[0,lj" -> S[0,lj" 
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4.2 Computability 

In this section, we introduce three different notions of computable partial solids 
related to the topologies defined in the previous sections. These three notions of 
computable partial solids lead to corresponding notions of computable operators 
over partial solids. 

To define computability, we need a countable basis of S[0, 1]". A dyadic voxels 
set is a finite union of cubes, each the product of n intervals whose end points are 
dyadic numbers. A partial dyadic voxels set (pdvs) is an element (A, B) G S[0, 1]” 
such that A and B are dyadic voxels sets. PDVS stands for the set of pdvs’s. 
The set PDVS is effectively enumerable. This means that each pdvs can be 
represented by a finite word and there exists a program able to compute a one- 
to-one correspondance between N and PDVS so that one can denote by pdvsi 
the pdvs and PDVS = {pdvsi,i G N}. 

Every compact subset of [0, 1]" is the intersection of all the dyadic voxels set 
containing it. It follows that every partial solid is the least upper bound of all 
the pdvs below it. In other words, we have the following proposition: 

Proposition 8. PDVS is a countable basis of the solid domain S[0, 1]”. 

Moreover, there exists a program able to decide, given two integers i and j as 
input, if pdvSi <C pdvsj or not. This means that the set {(z, j)| pdvSi <C pdvsj} 
is a recursive subset of N^. It is then a fortiori recursively enumerable. This last 
statement means the following: 

Proposition 9. The solid domain is effectively given with respect to the enu- 
meration {pdvsi,i G N} o/PDVS. 

It is now possible to introduce the three notions of computability. A partial 
solid (A, B) is said computable if it is computable according to the general defi- 
nition given in section 3 for maps between effectively given continuous domains, 
that is, if the set {k\pdvsk (A,B)} is recursively enumerable. 

It is said recursive if the set {k\pdvsk {A,B)} is recursive. 

It is said Lebesgues computable if it is computable and the Lebesgues measures 
of A and B are computable real numbers. 

In [3], several related notions of computability for compact sets are given in 
the TTE framework. (A,B) is a computable partial solid if A and B are co- 
r.e. compact sets according to the terminology of [3], and {A,B) is recursive 
if A and B are recursive compact sets according to the terminology of [3]. The 
following propositions state other equivalent definitions for computable, recursive 
and Lebesgues computable partial solids. Herefter, pdvSi, the z*^ pdvs, is denoted 
{Ai, Bi). 

Proposition 10. The following statements are equivalents: 

— The partial solid {A, B) is computable. 

— There exists a recursive function f such that A = r)kAf(^k)> B = C\kBf(^k)- 

— There exists a recursive function g such that A = C^kAg^k), B = C\kBg(k) ozzd 
the sequence of {Ag(^k)i Bgf^k)) is increasing for the information order C. 
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In the situation of the third statement of the last proposition, the sequence 
(Ag(fc), converges toward {A, B) for the Hausdorff metric. The proposition 

below (that can be shown using [3]) states that this convergence is effective if 
and only if (A, B) is recursive. 

Proposition 11. The following statements are equivalents: 

— The partial solid {A, B) is recursive. 

— There exists a recursive function f such that A = r]kAf(i.), B = C\kBf(u\ 
and dH{{A,B),{Af^,pBf^,^))<2-K 

We have a similar proposition for Lebesgues comput at ability. Here, for a compact 
subset X of [0, 1]", p,{X) denotes the Lebesgues measure of X. 

Proposition 12. The following statements are equivalents: 

— The partial solid {A, B) is Lebesgues computable. 

— There exists a recursive function f such that A = B = C\kBfi^k) 

and the sequence converges effectively toward (A,B) for the 

Lebesgues (semi) -metric. 

— There exists a recursive function g such that A = rifcAg(fc), B = C\kBg(^k) and 

^i{Ag(^k) n Hg(fc)) - ^i{A nB)< 2-^ 



Computability of the boolean operators and the membership predi- 
cate. From the effectively enumerated basis {pdvsi,i G N} of S[0, 1]" one can 
define an effectively enumerated basis of the product domain S[0, 1]" x S[0, 1]” 
that is then an effectively given domain. 

Proposition 13. Using the effectively enumerated bases given above, the U and 
n operators are computable maps from S[0, 1]" x S[0, 1]" to S[0, 1]". 

In other words, there exists a program that, given any pair of computable 
partial solids given by the computable sequences of pdvs approximating them, 
computes a sequence of pdvs’s approximating their union (respectively their 
intersection). Because each element of the output sequence is computed in a 
finite time, each of these computations use finitely many elements of the input 
sequences. 

Proposition 14. Using the effectively enumerated bases given above, the G pred- 
icate is a computable map from I[0, 1]” x S[0, 1]” to S[0, 1]”. 

From the fact that boolean operators are not continuous for the Hausdorf 
metric, it follows that they can not satisfy the corresponding notion of com- 
putability. Moreover, we have the following: 

Proposition 15. They are pairs of recursive partial solids whose union (respec- 
tively intersection) is not recursive. 
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This means that the set of recursive solids is not closed under boolean operators. 
On another hand, the information given by the computable sequence defining a 
computable solid is rather weak as, given any finite part of the sequence, one do 
not know how far is the actual solid. 

However, for Lebesgues computable partial solid there is a computable se- 
quence that approximate the partial solid up to any given accuracy for the 
Lebesgues semi-metric. 

Fortunately, we have: 

Proposition 16. Boolean operators are Lebesgues computable. 

In other words, there exists a program that, given any pair of Lebesgues com- 
putable partial solids given by the computable sequences of pdvs’s approximating 
them and effectively converging for the Lebesgues metric, computes a sequence 
of pdvs’s approximating their union (respectively their intersection) effectively 
converging for the Lebesgues metric. The computation of each element of the 
output sequence use finitely many elements of the input sequences. The sec- 
ond and third statements of proposition 12 give examples of a such sequences of 
pdvs’s approximating a partial solid and effectively converging for the Lebesgues 
metric. 



5 Applications and further developpements 

The primary goal of this approach is the design of robust geometric algorithms. 
Robustness issues show up when the values of the logical predicates evaluated 
from numerical computations are inconsistent, resulting in an invalid output or 
the catastrophic failure of the algorithm. 

Arithmethics for real numbers data type hyde the uncountable nature of com- 
putations as far as one deals with continuous maps and operators. Evaluation of 
geometric predicates, as the point membership to a solid represented by its ex- 
plicit boundary or by an implicit equation, is a typical example of discontinuous 
map. In order to make the mathematical model consistent with a realistic com- 
putation, domain theoretic approach considers continuous predicates valued in 
the boolean domain {tt, fF, T}. See [5] for an application to a completely specified 
and robust disk-disk intersection algorithm. 

The partial solid model may also help to design realistic specifications of 
algorithms computing boolean operators over B.Rep® models. For example, it 
seems to be the right framework to define computation over subsets of [0, 1]” 
defined by a implicit equation as: 

E = {x€[0,ir\ /(x) <0} 

For a computable continuous function / from [0,1]" to R. 

® B.Rep stand for Boundary Representation. 
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Abstract. Consider a finite non-vertical, and non-degenerate straight- 
line segment s = [so,si] in the Euclidian plane We give a method 
for constructing the boundary of the upper convex hull of all the points 
with integral coordinates below (or on) s, with abscissa in [a;(so), a;(si)]. 
The algorithm takes O(logn) time, if n is the length of the segment. We 
next show how to perform a similar construction in the case where s is 
a finite, non-degenerate, convex arc on a quadric curve. The associated 
method runs in 0{k logn), where n is the arc’s length and k the number 
of vertices on the boundary of the resulting hull. This method may also 
be used for a line segment; in this case, k — O(logn), and the second 
method takes 0{k^) time, compared with 0{k) for the first. 



1 Upper Hull of Grid Points below a Line Segment 

This paper will consider integral hulls, i.e. convex hulls of set of points with 
integral coordinates in the Euclidean plane (grid points) . A paper on 3D lattice 
convex hulls is Reveilles and Yaacoub’s [RY95]. 

This section describes a method for computing the boundary of the upper 
convex hull of all the grid points located below a given non-degenerate, non- 
vertical, line segment s (including those possibly on it). Obviously, only the upper 
hull boundary requires some computation. Hence, we shall use the terms “hull” 
and “upper hull” indifferently in the sequel, unless where specified otherwise. 

The supporting line of the segment has equation y = ax + b, and is unam- 
biguously described by a and b, since s is supposed not to be vertical; typically, 
these coefficients are rational numbers, although the method also applies to any 
computable field, i.e. a field in which the usual exact arithmetic operations (sum, 
difference, product, division, sign, floor) are available. 

Convention: We assume w.l.o.g. that the slope of s is non-negative (a > 

0): this allows a shorter presentation, and the other situation (a < 0) is deduced 
by symmetry. 

Since we want the convex hull of the integral points below s, we may restrict 
the problem to a segment the endpoints of which have integral abscisstE, say 
xo = [a;(so)l G ^ and xi = [x(si)J G Z. Such a segment is the hypothenuse of 
the rectangle triangle with vertices (xg, axo + b), {x\, axo + b), and (xi, axi +b). 
Let n = min(a:i — xq, [a(xi — a;0)J ) be the length (in number of unit intervals) of 
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the shortest edge in this triangle; then the brute-force method to compute the 
upper hull boundary takes 0(n) time. We summarize this method in sub-section 
2, where it is used as a subroutine of our own algorithm, which we now detail. 

1.1 Constructing the Upper Hull Boundary 

The method we propose runs in O(logn) time. It proceeds in two steps: 

1. The first step consists of computing the boundary of a prehull, which yields 
a superset - actually a superlist - of the final upper hull boundary. It is a 
super list because it generally contains: 

— points aligned with extremal vertices of the final hull, but that are not 
part of it^, 

~ but also some points that were previously found to lie on the convex hull 
boundary, but do not any more, due to concatenation (see below). 

2. The second step uses the brute-force method to scan the prehull boundary, 
and remove from it those points that are not part of the final list, in time 
proportional to the size of the boundary of the prehull (and, incidentally, 
that of the final upper hull boundary), i.e. O(logn). 

The First Step. 



Some notations. We define A = {xo,axo + b),B = (xi > xo,axi + b). AB 
is the segment. Set yo = [aa:o -I- b\, and let C = (xo,yo) be the grid point just 
below A (or equal to A when A is a grid point). We also need D = {x\, yo)- 
ABCD is called the “reference trapezoid” of segment AB. The trapezoid is 
said to be vertical when the angle between CD and AB is superior or equal to 
45 degrees (a > 1) (see Fig. 2) and horizontal otherwise (see Fig. 3 and 4). 



A 


• • • ^ • 


• 


c 

• • • • • 


« V w u W 

• • • • • 



Fig. 1. When [axo -I- 6J = \ax\ + b \ , the upper hull boundary is obviously the 
segment from vertex (a;o, [axo -I- 6J) to vertex (x\, [axi + 6J). 



If the segment is almost horizontal (see Fig. 1), i.e. if [axo + b\ = [axi -I- 6J, 
then the upper hull (and prehull) boundary is segment CD. 

^ If Hi, H 2 , . . . Hn is a maximal chain of aligned points of the boundary of a convex 
hull, only Hi and Hn are extremal vertices: a vertex on the boundary of the convex 
hull is either an extremity, or it is common to two adjacent segments with different 
slopes. Thus, for instance, the hull of the segment between point O = (0, 0) and 
M = (1000, 1000) only has two extremal vertices {O and M), among all the 1001 
integral points that the full segment comprises. 
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Otherwise, we distinguish two cases, depending on whether the reference 
trapezoid is vertical or horizontal, making use of the following lemma, that we 
give without proof: 

Lemma 1. Let A be a closed set in E^, and U an unimodular transformation 
o/E^. Let H{) denote the convex hull of a subset in the plane. Then 

U-\H{U{A))) = H{A). 

We recall that the restriction to 1? of an unimodular transformation of E^ 
is an affine bijection of as a consequence, the associated matrix has integer 
entries and determinant ±1. Geometrically, such transformations preserve areas, 
grid points, alignments, and convexity. Lemma 1 will be applied to reference 
trapezoids. 

Constructing the convex hull of A may be done by applying the reciprocal 
(C/“^) to the convex hull of the image by U of A. The principle of the algorithm 
will be to reduce the original problem to one with smaller size, by means of a 
well-chosen unimodular transformation, constructing the upper hull boundary 
of the image, and then deducing the original upper convex hull by inverse image. 
Let us now see how we can change a vertical trapezoid into an horizontal one, 
and vice versa. 




Fig. 2. A vertical reference trapezoid. 



Vertical trapezoid. Since we have assumed, w.l.o.g., that a > 0, we necessarily 
have a > 1 (see Fig. 2). We reduce this case to the next (0 < a < 1) by means 
of a unimodular transformation. 

Let T = {xt = xi,yT = J/o + (a^i ~ a^o)L®J)> i-®- ^ is the grid point below B 
such that CT has slope cr = [aj . 
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Consider that C is the origin of our coordinate system, and consider the 
following transformation U : (x,y) defined by 

x' = x 
y = y — ax 

Transformation U is obviously unimodular, leaves C invariant, and maps T to 
T' = D (see Fig. 2), and more generally vertically shifts vertical lines propor- 
tionally to their distance to the vertical line passing through C, so as to make 
line CT horizontal. 

Since A'B' has slope (a— [aj) less than 1, its reference trapezoid A'B'C'T' is 
horizontal, has smaller size than ABCD (the original trapezoid for segment AB), 
and an upper hull boundary with similar characteristics (number of vertices) to 
the ones of ABCD, due to Lemma 1. 

Hence, we only need to construct the upper prehull boundary for segment 
A'B' , and then deduce that for segment AB by applying U~^. This takes us to 
the second case. 

Horizontal trapezoid. Here, we necessarily have a < 1 (see Figure 3 and 4). 
The initial trapezoid ABCD contains a smaller vertical one, A2B2C2D2, with 
inverted axis^, which we obtain as follows: 

— B2 is the intersection point between AB and the horizontal line with equation 
y = 2/C + 1; 

— A2 is the intersection point between AB and the horizontal line with equation 

y = [vb\ ■ 

— C2 is the grid point just to the right of A2 (or A2 if A2 is a grid point) . 

— D2 follows in a straightforward fashion. 



• • • • »B 

A2 

• • «K 




Fig. 3. An horizontal reference trapezoid. 



Note that the left convex prehull boundary, /12, of the grid points located to 
the right of segment A2B2 may be computed, after some rotation and symmetry, 
with our method, by a recursive call. Now, the prehull boundary of AB is the 

By this, we mean that C 2 D 2 is vertical, while CD was horizontal. 



2 
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Fig. 4. An horizontal reference trapezoid. Note that some vertices on the upper 
hull boundary of A2B2C2D2 do not belong to that of ABCD. 



concatenation of: grid point C, /12, and grid point K = {x\, \ax\ + 6J), unless 
C2 = K. This should be obvious from Fig. 3 and 4. As mentioned earlier, note 
that some points originally on /12 may become non-extremal after concatenation 
with C, as in Fig. 4. This justifies the existence of a second step, to remove 
possibly invalid vertices. 

In some cases (see Fig. 5), we can reduce an horizontal trapezoid to a vertical 
one not completely contained in the horizontal one. The condition is that the tri- 
angle I AC does not contain grid points other than C, where I is the intersection 
point between line AB and line CD. In all such cases, B2 = I. 




• • • 

A2 

• 

• • * * 

* * * * 

• • — i • I) 

D2 




B K D 



Fig. 5. Refining the method. 



Running time of the first step. A vertical trapezoid with (integral) basis 
w and height h is reduced to an horizontal one with basis w and height h' = 
h mod w. An horizontal trapezoid with basis w and height h is reduced to a 
vertical trapezoid with basis w' < h and height h' < w. Thus, there are at most 
0(log(max(rc, h))) such reductions. 
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Similarly, the upper prehull boundary of a vertical trapezoid has as many 
points as that of the horizontal trapezoid it is reduced to. The upper prehull 
boundary of an horizontal trapezoid has at most 2 (thus 0(1)) vertices more 
than that of the vertical trapezoid it is reduced to. 

In the basic case which stops recursion, the upper prehull boundary has 2 
vertices. Thus the upper prehull boundary of a trapezoid with size n = max(t(;, h) 
has O(logn) vertices. Since the prehull is a superset of the hull, the latter also 
has O(logn) vertices. 

The hull may have O(logn) vertices in the worst case: as is well known, 
for a segment from (0,0) excluded to (q,p) G the vertices (x,y) of the 
upper hull of integral points {0 < x < q,y < pxjq) below the segment, and 
the lower hull of integral points (0 < x < q,y > px/q) above the segment, 
are given by the successive convergents | of the continued fraction expansion 
(CFE) of the rational slope p/q (Klein’s theorem). The smallest inputs (q,p) 
for which there are k convergents (thus k vertices in both hulls) are the two 
successive Fibonacci numbers q = Fk and p = Fk-i. We remind the reader 
that Fibonacci numbers are defined by : Fq = 0, Fi = 1, Fk = Fk-i + Fk- 2 , 
that Fk = (<?'= - F'=)/v^ = where <l> = (1 + v^)/2 « 1.618 ... is the 

golden number, and ^' = (1 — •\/5)/2 « —0.618... is its conjugate, and that 
k = [log,|i Ffc-\/5] . The notation [n] stands for n rounded to the nearest integer. 
See [GKP89] for a more detailed account. 

Of course, we want the running time of the first step to be O(logn), which 
takes a little effort: we can reach this complexity O(logn) if the function prehull (), 
after each of its O(logn) recursive calls, only performs 0(1) operations. Thus 
the function prehull () is typically allowed to append or prepend two new ver- 
tices to the list resulting from the current recursive call (i.e. the upper prehull 
boundary F), but not to apply any kind of linear transformation to the O(logn) 
vertices of h' . 

A solution is to let function prehull () receive h' in the correct coordinate 
system, as a result of its recursive call. Thus the coordinate system should be 
passed to it as an argument {via some transformation matrix, say). Function 
prehull 0 then uses this matrix to transform the 0(1) vertices it adds to the 
current upper prehull boundary, thus only performing 0(1) operations in addi- 
tion to its recursive call. 

This method has some similarity with Euclid’s algorithm ([GKP89]), and 
may be interestingly compared to the algorithm proposed by N. Kanamaru, T. 
Nishizeki and T. Asano ([KNA94]), for counting the number (or giving the list) 
of all the grid points inside a given triangle. 



The Second Step. Here, the upper prehull boundary is a list of grid points, 
ordered by increasing abscissae. This step removes those grid points it contains 
that are not vertices of the upper hull, using a technique akin to the well-known 
Graham scan ([PS85]) for constructing the convex hull of a set of points in the 
plane, arranged in some compatible order: 
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Scan the list : 

Let q be the current point, p its predecessor, 
and r its successor in the list. 

If pqr forms a strictly convex angle: 

Step forward: 

Set q <— r (and update p and r) . 

Otherwise : 

Remove q from the list, and 
Step back: 

Set q *— p (and update p and r) . 



This simplified presentation ignores special cases occurring at the head or at 
the tail of the list, but it is enough to assume pqr to be declared convex when 
either p or r does not exist. 

The method runs in time linear in the size of the list, k: there are at most 
0{k) back steps, since each one of them removes an item from the original list; 
and the number of forward steps cannot exceed k plus the number of back steps. 

2 Upper Convex Hull of Grid Points below a Convex 
Curve 

2.1 The Problem 

This section describes a method for computing the upper convex hull boundary of 
all the grid points below a given finite, convex and bi-monotonic arc of a quadric 
(parabola, hyperbola, ellipse), including those possibly on it. Bi-monotonic arcs 
have at most one intersection point with any line parallel to either of the two 
prescribed perpendicular directions (say Ox and Oy). From now on, we shall 
only consider arcs which are monotonic with respect to Ox and Oy: all quadrics 
may be partitionned in such arcs in constant time. 

Arcs are defined by the integral-coefficient equation of the quadric, F(x, y) = 
ax"^ + by^ + cxy + dx + ey + f = 0, and by the linear inequalities which their 
points must satisfy. For instance, the maximal arc of quadric where points have 
normal oriented towards the north-east quadrant carries inequalities: < 0, 

and Fy > 0 where F^{x, y) = 2ax + cy + d and Fy{x, y) = 2by + cx + e are the 
derivatives of F' in cc and y, respectively. 

Note that, in this formulation, the (possibly non rational) coordinates of arcs 
endpoints need neither be explicitly (i.e. with numbers), nor exactly represented. 



2.2 Constructing the Upper Hull Boundary 

We suppose that a method to compute intersection points between the arc at 
hand and any ray is available. A ray is a half straight line, defined by an origin 
(xo,?/o)) and a direction vector (u,v); in our setting, xg, yo, u and v are integers 
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(incidentally, the last two are supposed to be coprimes). Points (x,y) on the ray 
can be parameterized hy t: x = xq + ut, y = yo + vt. Substituting x and y in 
the quadric’s equation F{x,y) = 0, we get a quadratic equation E{t) = 0 with 
integral coefficients, which is (almost always, see Section 2.4) trivially solved. 
Only intersection points which satisfy the arc’s inequalities are kept. 

To compute the upper hull, we walk along the arc, starting from a known 
point S of the hull, say its leftmost point. The principle is to maintain a basis 
(/, J) of two vectors (see Fig. 6) such that: 

— I = (Ix, ly) and J = {Jx, Jy) have integral coordinates, 

— their determinant is equal to +1: 



Jx Jy 



+1 



This unimodularity condition guarantees that all grid points have integer 
coordinates relatively to the coordinates system (S,I,J), and conversely, 
all points with integral coordinates relatively to {S, I, J) are grid points. 
Moreover, the segment [S', S + J[ is cut by the curve, and will always be as 
we follow the curve. 

~ The last constraint on (S, I, J) is that the next vertex (T, to be computed) 
after S on the convex hull boundary has non-negative integral coordinates 
relatively to (S, I, J). 




Fig. 6. On the left handside, basis (/, J) is the initial one (/q, Jq). We are in the 
second case (as defined below), and the new basis (right handside diagram) is 
r = I + J, J' = I + 2 J. In both diagrams, we have drawn the lattice generated 
by translating the elementary unit “cell” (shadowed). The set of all vertices in 
each lattice coincides with Z^, thanks to unimodularity. 
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For instance, if the arc to be followed has normal oriented towards North- 
East, then the initial values for / and J are /q = (1,0) and Jq = (0,1); other 
cases are found by symmetry. 

The method computes the next point T after S on the convex hull; then it 
jumps to T, i.e. T becomes the new origin S, and the basis (/, J) is reset to 
{Iq, Jq) - a constant for a given arc. 

It remains to explain how the basic step, NextPoint(arc, S, I, J) computes 

T. We shall use the following notations: if P is a point and V a vector, P -|- E is 
the point Q such that PQ = V; Ray(P, U) is the ray with origin P and direction 

U. 

There are several possible configurations for the arc S, I, and J, as discussed 
below. First note that, for the sake of clarity, figures use an orthonormal basis (/, 
J), though actually only the initial basis (/q, Jq) is orthonormal, (with the excep- 
tion of Fig. 6 where we have represented the process “as is”, for a comparison). 
This convention is justified by the fact that the transformation applied to (/, J) 
to get “orthonormal figures” is unimodular, i.e. it preserves area, alignments and 
grid points. 



Case 1. In case 1 (see Fig. 7, 8, and 9), the arc does not cross ray Ray(5' -I- J, 
I), or crosses it twice (possibly tangentially) inside the interval (S'-!- J, S'-!- J-|- /[. 

Then, if the arc crosses Ray(S'-|-/, I) at point P, we are in case 1.1 (Fig. 7); T is 
known and it is the grid point just before P, i.e. if P has coordinates {Px, Py = 0) 
in coordinate system (S,I,J), then T has coordinates {Tx = [Px\,Ty = 0); this 
should be obvious on Fig. 7. 

Otherwise, the arc does not cross Ray (S' -I- /, I). We are either in case 1.2 or 

1.3. 

We are in case 1.2 (see Fig. 8) if we have reached the arc’s end: we clip Ray(S, 
I) by all linear inequalities which are part of the arc’s definition. Let P be the 
right endpoint of the clipped ray. Then T has coordinates {Tx = [Px\,Ty = 0); 
this should be obvious on Fig. 8. 

We are in case 1.3 (see Fig. 9) if the arc is tangent to an asymptote parallel 
to /, and if we accept as input arcs with infinite length; here T is the point 
at infinity (-|-oo,0). This kind of case occurs for instance with the hyperbola: 
0 < x, 0 < y, cry — 1 > 0. Of course, there is the risk that the upper convex hull 
boundary of infinite arcs has an infinite number of vertices, and in such a case, 
the method never stops! Thus, we mention case 1.3 only for completeness: it is 
a priori impossible. 

Thus, in case 1, the method terminates and returns T. 




Fig. 7. Case 1.1. 
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Fig. 8. Case 1.2. 



4 



* ^ 

S I 

Fig. 9. Case 1.2. 



Case 2. In case 2 (see Fig. 10), the arc crosses Ray(S' + J,I) once, and the 
intersection point belongs to [S + J, S + J + /[. Due to monotonicity, either the 
arc cuts Ray(S' + I, J) in one point P and we are in case 2.1, or it terminates 
before and we are in case 2.2. A case similar to case 1.3, with an asymptote 
parallel to J, is impossible. 

In case 2.1, P exists and has coordinates {P^ = ^,Py) in (S,I,J). Let 
b = [Py\, I' = I + bJ, J' = r + J. Note that (S', J') fulfills the pre- 
scribed requirements (unimodularity, etc). Then we recursively compute T = 
NextPoint(arc, S, I', J'). 




Fig. 10. Case 2. Left: case 2.1. Right: case 2.2. 



In case 2.2, Ray(S + /, J) is clipped by all linear inequalities of the arc’s defi- 
nition. Let P be the highest endpoint of the clipped ray. Then T has coordinates 
(T, = 1,T,= LP,J). 

Case 3. In case 3 (see Fig. 11), the arc crosses Ray(S -I- J,I) at least once, 
but the intersection point closest to S -I- J (call it P = {Px,Py = 1)) does not 
belong to [S -I- J, S -I- J -I- /[. Then let a = \Px], P = al + J, J' = I' — I; 
again (S, P, J') fulfills the prescribed requirements (unimodularity, etc), and we 
compute recursively T = NextPoint(arc,S', J'). 
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Fig. 11. Case 3. 



2.3 Justification and Running Time 

That we obtain the upper convex hull in the fashion described above is inherently 
justified by the fact that the curve arc is convex and we always select the next 
local integral maximum under the arc. 

Indeed, method NextPoint(arc, S, I, J) basically mimics the Continued Frac- 
tion Expansion of the rational slope | of segment ST, and hence selects the best 
possible approximations in increasing order. More precisely, method NextPoint 
starts with basis Iq, Jo, and, in the worst case, either transforms J„ into 



J -^n+1 — Jn 

\ Jn-\-l ~ 


(2) 


in case 2, or into 




J -^n + l — + Jji 

\ ~ Jn 


(3) 


in case 3 (in all other cases, it terminates). At step k, Ik = {xk yk) and Jk = 
{xk Vk) have coordinates Xk, yk, x'f., j/(, superior or equal to 2*“^ (proof by 



recurrence). After a logarithmic number of steps, Ik length becomes greater 
than the arc length, and method NextPoint terminates. 

This implies that method NextPoint performs 0(log(max(p, q))) operations; 
here the most expensive operation is the resolution of a quadratic equation with 
integral coefficients. It is usually assumed (although debatable) that such an 
operation takes constant time, p and q are bounded by the length (say n) of the 
minimal rectangle enclosing the arc. Thus NextPoint runs in O(logn), and is 
called 0{k) times if there are k vertices on the hull boundary. 

Hence, the method runs in O(fclogn) time. It may be used on a straight line 
segment: in this case, k = O(logn), and it runs in 0{k'^) time, compared to 0{k) 
for the method of the previous section, designed for straight lines only. 

The running time O(fclogn) may perhaps be improved: 

— First, the previous running time analysis was a bit pessimistic to remain 
simple. More optimistically, if (^) is the sequence of slopes along the hull, 
then the number of operations is 

fc-i 

^log(max(ai,&i)). 

2 = 1 
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— Then the method itself may perhaps be improved: at each vertex, it starts 
with basis I, J equal to /q, Jq, the same initial basis for all arcs; we do not 
use the fact that the slope of SkSk+i is likely to be close to that of Sk-iSk, 
i.e. the first reduced fractions of these two slopes are equal. For the time 
being, we have not investigated such tracks. 



2.4 Accuracy Problems 

A straightforward implementation using floating-point arithmetic faces difficul- 
ties due to inaccuracy. Typically, the program will loop at some hull vertices: 
solving a second degree equation is not so trivial!. . . 

Basically, we need to solve quadratic equations E{t) = 0, with integral coef- 
ficients, in such a way that if a root is an integer, we must compute it exactly, 
but otherwise, it is enough to know that it belongs to some interval ]z,z + 1[ 
with z G Z. This is easily done. 

Integers on 32 bits may be not sufficient: a Biginteger library should be used 
to prevent overflows (provoked, among other things, by the repeated application 
of unimodular transformations) . The bad news is that basic operations no longer 
take constant time. The good news is that an integer arithmetic is sufficient: no 
(very time consuming) quadratic arithmetic (providing the exact square root to- 
gether with the usual exact operations -k, — , x, /, and comparisons) is required. 



2.5 Number of Vertices 

Our method is 0{klogn), the brute force one 0{n) : clearly our method is of 
interest only if k log n < n ^ k < . Thus the question arises to bound k in 

function of n. 

We are still working on this subject. This section only gives measures of k for 
quarter circles with variable radius r. Figure 12 shows a log-log diagram with k 
in ordinate and r in abscissa. Circles have equation: x'^ + < r^. Tested radii 

are: r = 2, 4, 8 . . . 2^^. The arc is limited by: a; > 0, t/ > 0. For r large enough, say 
r > 128, points (r, fc(r)) are roughly aligned, so that k « 0.86r° ®®®. Of course, 
it is an empirical statement, not a theorem. 

Asymptotically, 0(n° ®®®) is smaller than 0(j^^), so for large enough radius 
values, our walking method will perform faster than the brute force one. 

2.6 Extension to other Curves 

As a matter of fact, our solution applies to any convex curve arc, provided 
a method to compute the intersection points between the arc and any ray 
(x{t),y{t)) is available. In the case of an algebraic curve of degree d with in- 
tegral coefficients, computing such an intersection reduces to finding the roots 
of a polynomial in t with degree d. As is already the case for quadrics, it suffices 
to know when roots are integral, and otherwise to detect an isolating interval 
not containing integers. 
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Fig. 12. This diagram shows the number k of vertices in upper hulls of grid 
points below quarter circles with various radii r = 2, 4, 8, 16 . . . 16384. Scales are 
logarithmic. 



The difficulty is rather in characterizing arcs. A possibility is to partition 
the curve into arcs with Collins’s cylindric algebraic decomposition algorithm. 
However, the preprocessing step is no longer constant, as it depends on the 
algebraic degree of the curve. 

We have not investigated quadrics with algebraic but not integral coefficients; 
however, such quadrics are subsets of algebraic curves with integral coefficients 
and higher degrees, that elimination methods (e.g. resultants or Grdbner bases) 
enable to compute; for instance, the line with equation x — \/2y = 0 is a subset of 
— 2y^ = 0, a quadric with only integral coefficients; thus its upper convex hull 
boundaries may be computed with our method for quadrics, using only integer 
arithmetic; they can also be computed with the method for line segments, but 
some arithmetic in the quadratic field Q[-\/2] is required. We won’t go into such 
subtleties, due to lack of space. 

3 Conclusion 

This paper has presented methods for computing upper convex hulls of grid 
points below a segment or an arc. We are currently investigating the possibility 
of improving these methods, by exploiting geometric properties of such hulls, like 
the presence of periodic patterns (i.e. automorphisms). This question is related 
to Number Theory, especially to quadratic number fields, their units and the 
Pell-Fermat equation. 
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Rounding Voronoi Diagram* 
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Abstract. Computational geometry classically assumes real-number ari- 
thmetic which does not exist in actual computers. A solution consists in 
using integer coordinates for data and exact arithmetic for computations. 
This approach implies that if the results of an algorithm are the input 
of another, these results must be rounded to match this hypothesis of 
integer coordinates. In this paper, we treat the case of two-dimensional 
Voronoi diagrams and are interested in rounding the Voronoi vertices 
to grid points while interesting properties of the Voronoi diagram are 
preserved. These properties are the planarity of the embedding and the 
convexity of the cells. We give a condition on the grid size to ensure 
that rounding to the nearest grid point preserves the properties. We also 
present heuristics to round vertices (not to the nearest grid point) and 
preserve these properties. 

Keywords: geometric computing, Voronoi diagram, integer coordinates, 
exact computations 



1 Introduction 

Theoretical computational geometry often assumed real-number arithmetic to 
model computations. In practice, this model is not implementable on an actual 
computer and the use of floating point computation as an approximation of real 
arithmetic is well known to yield difficult precision problems in the implementa- 
tion of geometric algorithms. Recent trends in computational geometry consist 
in using integer coordinates to represent the data, and exact integer arithmetic 
to make the computations [4,5,12]. 

Exact output: A solution to solve precision problems consists in using some exact 
representation for the input and exact computations to take decisions inside 
the algorithm. For example, if point coordinates are 6-bits integers, then the 
intersection of line segments can be solved using an exact 26-bits arithmetic [3] . 
Unfortunately, this approach works only for a single algorithm. If two algorithms 
have to be cascaded and the output of one must be the input of another, then 
we need an exact representation of the output. In the example of intersection of 
line segments, rational numbers with 26-bits (numerator and denominator) are 
needed to represent intersection points coordinates. If several algorithms have to 
be cascaded this approach yields an unacceptable increase in the precision used 
to store results. 

* This work was partially supported by ESPRIT LTR GALIA 
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Rounding: The alternative approach consists in rounding the result of an algo- 
rithm before starting the next one. Rounding means to move the exact results 
to some fixed sized integer representation. If the results consist in points, they 
must be moved to coincide with vertices of a grid. In that case, a brute force 
rounding may alter the properties of the result, for example a convex polygon 
may loose its convexity after rounding, and the second algorithm may not work 
on the rounded result; thus it is necessary to define rounding methods preserving 
the geometrical properties of objects. The rounding of geometric structures is a 
new concern in the domain, to the knowledge of the authors, previous works deal 
only with the rounding of arrangement of line segments [8,7] and arrangement 
of triangles in 3D [6] . These works use snap rounding: a point is rounded to its 
nearest grid point. 

Voronoi: In this paper, we address the particular problem of rounding the 
Voronoi diagram of a set of points S in the plane (see Definition 1). An ex- 
ample of cascading algorithms may be found in 3D reconstruction, where we 
need to compute the overlap of two Voronoi diagrams [1,2]. Given a diagram 
(a graph embedded in the plane), we will say that this diagram is planar if its 
edges do not cross and convex if all cells are convex. It is well known that a 
Voronoi diagram is planar and convex and there exist algorithms to compute 
the overlay of two convex planar diagrams, but convexity and planarity may not 
be preserved if the vertices of the Voronoi diagram are snap-rounded to grid 
points. In Figure 1, the rounded diagram (in dashed lines) has a non convex cell 
(the shaded cell) . In this paper we will investigate on which conditions the snap 
rounding of a Voronoi diagram will preserve planarity and convexity. We will 
also develop other ways of rounding than snap rounding. 




Fig. 1. Rounding of Voronoi diagram 



Counter example: A good rounding does not exist in the worst case. As shown 
by Figure 2, it is not possible to have simultaneously in a rounded version of the 
diagram the convexity of cell of p and a fixed bound on the distance between 
Voronoi vertices and their rounded versions. 
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Fig. 2. In this example, a “good” rounding does not exist 

Rounded computational geometry versus discrete geometry: The use of a grid in 
computational geometry implies some convergence with fields such as discrete 
geometry or computer vision. The main difference between the approaches used 
in these domains is a question of order of magnitude. In an image a grid point is 
a pixel, and the typical size of an image is 1000 x 1000 pixels. In computational 
geometry, the number of grid points depends on the integer arithmetic, a grid of 
16.000.000 X 16.000.000 (for 24 bits integers) is a lower bound. This difference 
of scale has consequences on the algorithm, for example storing the entire grid 
is not possible with that sizes. 

Voronoi diagram definition: 

Definition 1 Let S he a set of n points in the plane, M\, , Mn, which we 

call the sites to avoid confusion with the other points in the plane. To each site 
Mi we attach the region V{Mi) that contains the points closer to Mi than to any 
other point in S: 

V{Mi) = {X G IR2 / 5(X, Mi) < 5{X, Mj) Vj i} 
where 5 denotes the Euclidean distance in IR^. 

The region V{Mi) is the intersection of a finite number of closed half-planes 
(bounded by the perpendicular bisectors of MiMj, j = 1, ... ,n, j i). This 
shows that V{Mi) is a convex polygon, which may be bounded or unbounded. 
The V{Mi)’s and their edges form a cell complex called Voronoi diagram of S, 
whose domain is the whole IR^. 

The Delaunay triangulation of 5 is a complex dual to the Voronoi diagram 
of S, and can be obtained from it by joining the sites whose Voronoi cells are 
adjacent (see Figure 3). 

2 Sufficient condition of correct ronnding 

Given a set S of sites in the plane and a grid step p, the main purpose is to 
know if the Voronoi diagram of S will keep, after snap rounding of its vertices, 
its properties of convexity (each angle of the diagram should stay < tt after 
rounding) and planarity (each angle of the diagram must keep its orientation 
after rounding). We will first focus on a convexity criterion and then adapt the 
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Fig. 3. Duality between Voronoi (continuous lines) and Delaunay (dashed lines) 



results into a planarity criterion. If the criterion condition is not satisfied, we 
want to compute pmax the distance between two consecutive grid points or 
grid size which guaranty the preserving of these properties. The condition must 
depend only on the original data: the coordinates of the sites of S, and the 
topology of the diagram (which is equivalent to the knowledge of the Delaunay 
triangulation); the criterion should not depend on an explicit computation of 
the Voronoi vertices coordinates. 

2.1 Parameterization 

Let A, B, C he 3 Voronoi vertices forming an angle a in A, and A! , B' , C the 
snap rounded corresponding vertices. The angle a' = {A'B' , A'C') has 7 degrees 
of freedom and thus depends on 7 parameters: the coordinates of the points A, 
B, C plus the grid size p for example. In order to make some simplifications, we 
can use an other choice of parameters, a' can be expressed as a function of the 
angle a, the lengths b = AC and c = AB, the orientation 6 and the position 
{xa,Ua) of the triangle {ABC) in the grid, and the grid size p (see Figure 4). 
But in order to know if, for a given grid size p, a given angle will be rounded 
correctly regarding its convexity, we will have to solve the equation a' = tt in the 
variable p. This equation is too intricate and we will make some simplifications 
and over-estimations. 




Fig. 4. a' depends on 7 parameters 
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2.2 Monotonicity of snap rounding: pre-criterion 

Let ABC be an angle, and let divide the plane in four quadrants from the vertex 
A. The monotonicity of snap rounding (i.e. xa < Xb ^ round{xA) < round{xB) 
and Va^Vb^ round{yA) < round{yB)) involves that the extremities B and 
C of the angle stay in the same quadrant when they are rounded (see Figure 
5). Therefore, given an angle whose extremities do not lie strictly in opposite 
quadrants, the corresponding rounded angle is necessarily < tt. Such an angle is 
declared as not risky for the convexity and no further verifications are needed. 
The main criterion for convexity described below will be computed only for 
risky configurations, i.e. whose extremities lie strictly in opposite quadrants of 
plane (see Figure 5). We obtain thus a preselecting criterion that we will call 
pre-criterion in the sequel. 





B 


B 

/ 


A 


B 

/ 


A 


A 




A 





Non risky Voronoi angles Risky Voronoi angles 

Fig. 5. Use of the monotonicity of snap rounding 



2.3 Simplification 

To reduce the number of variables in the expression of the rounded angle a', 
we will suppress the parameters (xA,yA) by considering the worst case, i.e. the 
case that maximizes a' . A simple way to consider the worst case is to assume 
that the three points A, B, C lie at centerpoints of a grid squares and thus have 
a maximal move of half a pixel diagonal. So given 3 points A, B, C centers of 
3 unit-squares the problem is to choose A', B', C on squares vertices so as to 
maximize o' . The general case is not obvious, because this choice depends on 
the values of a. and B. However in a risky configuration, the points B and C are 
in opposite quadrants and the worst position of the points A', B' , C is clearly 
the one shown in Figure 6. 




Fig. 6. Position of A' , B' , C maximizing cf 
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2.4 First criterion: preserving the convexity of angles 

The rounding of a point can be viewed as a translation by some vector u. We 
will now use the fact that with the assumptions of the previous paragraph, 
the vectors of translation of points A, B, C have the same direction and the 
same norm. More precisely, if A is translated by u, B and C are translated 
by —u. But these 3 translations are equivalent — considering the value of the 
rounded angle a' — to translating the point A by 2u with B and C fixed. Since 
II u 11 = V2p, it follows that the maximum grid size Pmax such that a' is convex, 
is given by the length h' j^/2 (see Figure 7). To calculate h' , we will use the 



A 




height h of triangle {ABC) and angle C = 6 + j. Basic relations of triangle 
geometry give: h = c sin(/3) and h' = . After reduction, we obtain: 

C = cot(/3) ■ remains to be expressed cot(/3) as a function of b, 

c and a. Once again, the relations of triangle geometry give, after reductions: 
cot(/3) = • So finally, the maximum grid size warranting convexity of 

the rounded Voronoi diagram is: 



Pmax = 



c /^/2 



cos(C) + sin(C) 



c—b cos(a) 
b sin(a) 



( 1 ) 



2.5 Semi-infinite Voronoi angles 

We call semi-infinite Voronoi angles the angles containing an infinite point, i.e. 
defined by a segment [AB] and a ray [AC). The most natural way to round 
this kind of angles is to snap round normally the segment [AB] in a segment 
[A'B'[, and to transform the ray [AC) in a parallel ray [A'C). This case can be 
treated like the general case, with a few modifications. With the same notations 
as previously, the worst case consists here in translating the point B by —2 u. 
Therefore, we have to express the length h! as a function of c, a and C. It follows 
that the maximum grid size such as the rounded semi-infinite angle remains 
convex is: 

c/V2 

cos(C) — sin((() cot(a) 



Pmax — 



(2) 
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2.6 From Voronoi to Delaunay 

We have thus obtained a criterion expressed on some lengths and angles in the 
Voronoi diagram. We now have to explain how these quantities can be computed 
from the original data, that is from the sites coordinates without an explicit 
computation of the Voronoi vertices. We will assume that the Delaunay triangu- 
lation, which encodes all the combinatorial information of the Voronoi diagram, 
is known. A Voronoi angle a can be deduced from its dual a\ in Delaunay by 
a = 7T — oi (see Figure 3). 



Q 




Fig. 8. How express Voronoi data from Delaunay data 

The only difficulty is to express the edge lengths b and c using only Delaunay 
angles and Delaunay edges lengths. On Figure 8, A, B and C are the vertices of 
a Voronoi angle, the triangles (PQR), (PQS), and (PQT) are the corresponding 
Delaunay faces whose radii of circumscribed circles are respectively ri, r 2 and r^. 

We get b = ri cos(/3i) -I- cos(t 7 i) and c = ri cos( 7 i) -I- r2 cos(5i). For the 
radii of the circumscribed circles, we have the relation (with the usual notations 
in a triangle (ABC)): r = Applied to triangles 

(PQR), (PQS), and (PQT), this gives for h and c: b = ^ (cot(/3i) -I- cot(? 7 i)) 
and c = ^ (cot( 7 i) -I- cot((5i)). At last, the orientation C, = (AB,u) of the 
angle a in the grid is given too by C = {PQ, (—1) !))• Inserting in (1) and (2) 
the preceeding relations we obtain finally a criterion that apply to Delaunay 
triangulation data only. 

2.7 Second criterion: preserving the orientation of angles 

We have seen that an angle close to tt could become non convex after rounding. 
Likewise, the rounding of an angle close to 0 can change its orientation, which 
can cause the overlapping of Voronoi cells and the appearance of non simple 
polygons. To avoid this, we want to know, given S a set of sites in the plane 
and p a grid size, if each angle of the rounded Voronoi diagram of S will keep 
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its orientation, or to obtain a grid size Pmax that guaranties the preserving of 
the orientation. This problem is similar to the problem of convexity, and can be 

solved using the same methods. Finally, we obtain pmax = cos(OH^sin(c/ ^(a) ■ 



3 Use of the criterions 

Poisson Voronoi diagrams: Poisson Voronoi diagrams are diagrams generated 
by a set of sites uniformly distributed in the plane, which is relatively well 
approximated by an uniform distribution in a finite surface. Okabe et al. [11], 
[10] have shown that the probability density function of a randomly selected 
angle a of a typical Voronoi cell in is /(a) = 3^ sin(a) (sin(a) — a cos(a)). 
In particular, it follows from this distribution that p{a < tt/ 2) = which shows 
that the convexity problem is much more frequent than the orientation problem. 

Statistics on the pre- criterions: Experimentally, we found that, for uniform dis- 
tributions of sites, about 1 /3 of the angles are in a risky configuration regarding 
the convexity or the orientation of the rounded angles. This ratio does not de- 
pend on the density of the sites. 

Criterions versus effective computation: One of the main purposes that we want 
to achieve is to have a way to detect the problems that occur when rounding a 
Voronoi diagram without computing the diagram itself. Obviously, this way must 
be cheaper than the computation of the rounded Voronoi diagram and testing 
its planarity and convexity. A careful study of the basic operations needed shows 
that above criterion can be computed in about half the time of the computation 
of the Voronoi diagram and its rounding. 

Results on the criterions: The implementation of the criterions allowed us to 
make some statistics on its efficiency. The criterion is a sufficient but non neces- 
sary condition. Indeed, assuming a maximal move of half a pixel during rounding 
clearly under estimates the grid size needed for a correct rounding. The criteri- 
ons detect as risky some angles that do not create problems when they are snap 
rounded. The figure 9 shows the correspondence between the criterion and the 
actual rounding. 




Fig. 9. Comparison of the criterion with the actual rounding 
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Adaptative grid size: As described in Section 2, the criterions can be used to cer- 
tify that the snap rounding will preserve planarity and convexity or to determine 
a small enough grid size to ensure these properties. The interest of the criterions 
is to give directly a grid size relatively close to the maximal size, and thus avoid to 
test systematically the rounding of the Voronoi diagram with different grid sizes. 

Our experiments (see Figure 1) show the mean value (for 100 experiences) 
of the maximal grid size allowed by the 2 criterions (convexity and planarity), 
as a function of the sites distribution. 

Basic property of the Voronoi Diagram: The main advantage of the criterions is 
to validate the use of snap rounding. Indeed, the rounding of each vertex to the 
nearest grid point is the best way to preserve the tesselation to the closest points. 
It is easy to show than for a proximity query on the snap rounded diagram, the 
difference between the distance to the site given and the distance to the site that 
would have been given by the original diagram is less than where p is the 

grid size. If the point is to obtain the exact nearest site, a possibility is to use 
robust proximity queries methods [9] . 

4 Rounding heuristics 

When the grid size cannot be chosen according to the distribution of the sites, the 
snap rounding can create bad angles. To preserve the fundamental properties of 
the Voronoi diagram, we will have to move some vertices not to the center point 
of the pixel which contains it, but in the center of a pixel in the “neighborhood” 
of the vertex (this notion of neighborhood remains to be defined). The main 
difficulty comes from the adjacency relations between angles. Indeed, moving a 
vertex in order to convexity a given angle, can generate non convex angles in 
the neighborhood of this angle (each Voronoi vertex belongs to 9 angles of the 
diagram) . 

We define briefly a rounding heuristic in the following way: 

— the choice of a set of points candidates for the rounding of a given vertex, 

— the choice of a cost function which gives the best candidate (in a way) of the 
set. 

Definition 2 We call kernel polygon associated to a Voronoi vertex v the convex 
polygon obtained as the intersection of (see Figure 10): 

— The triangle joining the extremities of the angles containing v as apex, 

— The 3 wedges defined by the pair of edges incident to the 3 neighbors of v 
and to V. 

As shown in Figure 10, the kernel polygon of a vertex v is the set of points 
where v can be moved preserving the convexity and the positive orientation of 
any angle involving v, when the others vertices are fixed. A diagram has the 
properties of convexity and planarity if and only if any vertex belongs to its 
kernel polygon. 
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Fig. 10. Kernel polygon of a Voronoi vertex v (Voronoi diagram in continuous 
line) 

If the snap rounding of the Voronoi diagram of S is not convex and planar, 
the following heuristic can be used: 

Algorithm Rounded Voronoi(S) 

1 . W ^ Voronoi diagram of S 

2. 7ZVD <— Snap rounding(VH) 

3. while TZW not planar and convex 

4. do 

5. u <— A vertex of TZVT> involved in a bad angle 

6. K{v) ^kernel of v in TZW 

7. u' <— A grid point inside K{v) (if it exists) 

8. Modify TZW by moving v to v' 

9. return TZVV 

At this point, this algorithm is not yet completely clear, we need to know how 
V is chosen among the bad vertices and how v' is chosen among the grid points 
inside K(v). Furthermore, in case of failure, the algorithm, as presented above, 
fail in an infinite loop instead of ending with a clear status. 

First implementation: A first simple approach to choose v is to use all bad angles 
in turn. The apex of the angle is tried first, then if its kernel contains no grid 
points the extremities of the angle are tried. The problem of the choice of v' can 
be solved easily by taking v' the grid point of K{v) nearest to v. If for a given 
angle, none of the three vertices involved can be moved inside its kernel, then 
we end with a failure status. A rounding preserving convexity and planarity has 
not been found. 

Such an approach works well in practice. If the density of points is not too 
high, then the bad angles are sparse in the diagram and the order used to examine 
them has no influence. Very often K(v) contains grid points, more precisely, quite 
often at least one of the vertices of the triangle used in the kernel definition, say 
w, belongs to K{v); rounding v to w has the effect of contracting the edge vw 
and increases the degree of the resulting vertex. 

This algorithm has been implemented and gives very good results since in 
the worst case, it corrects all the bad angles but less than 0.05% of them. The 



Rounding Voronoi Diagram 385 



first two lines of table 1 correspond to data coming from a GIS database (raw 
data in line 1, dilated data on a scale of 24 bits integers in line 2), while the 
third line corresponds to an uniform distribution of points. 



Number 

of 

points 


Side 
of the 
square 


Total 
number 
of angles 


Risky 
angles for 
pre-criterions 


Risky 
angles for 
criterions 


Maximum 
grid size 


Bad angles 
for actual 
rounding 


Bad angles 
after 
heuristic 


120973 


58093 


725727 


304901/62992 


76564/859 


very small 


8669/38 


14/0 


120973 


11618600 


725727 


304901/62992 


461/4 


0.0011 


52/0 


0/0 


1000000 


100000 


5999598 


2190626/190760 


175988/620 


very small 


14664/14 


4/0 



Table 1. Excerpt from the table of results (columns of risky or bad angles give: 
number of angles for convexity / for planarity) 

Generalizations 

Edge contraction: As noticed above, the algorithm still has some freedom to 
choose V and v' . We first can remark that the problems usually come from an 
initial angle which is close to tt (convexity) or to 0 (orientation), or from an initial 
edge whose length is small. Problems created by small edges can be solved by pro- 
moting vertex merging, if K{v) contains w, a neighbor of v close to v, then we can 
choose w as new position for v. This choice results in the contraction of edge vw. 

Vertex rounding propagation: Since the difficulty may come from the propagation 
of the bad angles, it would be interesting to try to direct this propagation along 
a centrifugal axis, i.e. in direction of the convex hull of the initial sites. Indeed, 
the semi-infinite Voronoi angles are less constraining than the finite ones since 
they contain an infinite point. In particular, the “kernel polygon” for an infinite 
vertex is an angular sector, therefore it contains an infinite number of grid points 
(see Figure 11). 




Fig. 11. Kernel polygon of an infinite Voronoi vertex 



5 Conclusion 

We have presented several results about the rounding of a Voronoi diagram 
preserving its planarity and the convexity of its cells. The first idea consists 
in snap rounding (rounding to the nearest grid point) all the vertices of the 
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diagram. We have established a reasonably cheap condition which ensures that 
the snap rounding preserves these properties. We have studied experimentally 
the efficiency of the snap rounding and the tightness of our condition on random 
Voronoi diagrams. 

We have proposed an heuristic algorithm, which may round the Voronoi 
vertices further away in the grid, while preserving the desired properties. This 
algorithm works very well in practice as it is shown by our experimental results. 

We have proven that, in the worst case, it is impossible to preserve planarity 
and convexity and to guarantee a fixed bound on the distance between a vertex 
and its rounded version. This paper proves that, with reasonable hypotheses on 
the data distribution, snap rounding and some heuristic will actually succeed to 
round the Voronoi diagram with good probability. 

Another, more theoretical, direction of research consists in searching a de- 
terministic algorithm of rounding preserving convexity and planarity such that 
something is provable on the distance between a Voronoi vertex and its rounded 
correspondent. Since a constant bound is not achievable, a bound depending on 
the distance to the nearest site would be a good result. 
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Abstract. Existing algorithms for rendering Bezier curves and surfaces fall into 
two categories: iterative evaluation of the parametric equations (generally using 
forward differencing techniques) or recursive subdivision. In the latter case, all 
the algorithms rely on an arbitrary precision constant (tolerance) whose appropri- 
ate choice is not clear and not linked to the geometry of the image grid. In this 
paper we show that discrete geometry can be used to improve the subdivision al- 
gorithm so as to avoid the need for any arbitrary value. The proposed approach 
extends well and we present its application in the case of 2D and 3D Bezier curves 
as well as Bezier triangle patches and tensor-product surface patches. 

Keywords. Bezier curves, Bezier surfaces, De Casteljau subdivision, polygonal- 
ization 



1 Introduction 

Bezier curves and patches are among the most fundamental primitives in computer 
graphics and computer aided modeling. However, as they are defined by means of 
mathematical equations, they are continuous objects which are not ideally suited for a 
computer representation. Discrete geometry aims at providing an equivalent of these 
mathematical objects in the same way as it has established formal definitions of digital 
lines and planes [9]. A first step towards this goal consists in developing a discretization 
algorithm of continuous Bezier curves and patches which would be consistent with ex- 
isting results in discrete geometry. In this paper, we propose such an algorithm. Our ap- 
proach is based on a classical De Casteljau recursive subdivision algorithm but with a 
new flatness criterion based on the digital geometry of lines and planes which guaran- 
tees a recursion depth close to optimal and an appropriate connectivity of the obtained 
discrete curve or surface with no need for arbitrary constants. Moreover, though we re- 
strict our presentation here to the case of cubic Bezier curves and patches for the sake 
of clarity, it is a remarkable fact that the approach is general enough to be easily extend- 
ed to higher degrees and dimensions. 
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2 Discrete lines and planes 



It has been shown that beyond an algorithmie deseription, digital lines and planes ean 
be given rigorous mathematieal formulations. Thus a planar digital line is a subset of 

' ^ , denoted D(n, y , p ) deseribed by a diophantine equation of the following form : 



y>P): + <Y+ p (1) 

2 

where (n^, n^)G ' defines the direetion of the line, y & ' defines its affine offset 
while p G ' is ealled arithmetieal thiekness. A partieularly interesting subset of digital 
lines eonsists of those verifying p = |n^| ), whieh are simply denoted 

Z)(n,Y): 



Z)(n, Y ): Y ^ + n^y < y + max(|nj, |n^| ) 



( 2 ) 



whieh are ealled naive digital lines and have exaetly the same strueture as the sets drawn 
by the elassieal Bresenham algorithm for lines [1]. The most important property of na- 
ive lines is striet 8-eonneetivity. 

Z)(n, y ) is the digitization by tmneation of the ordinary euelidean line of equation 



n^x + n^y = y while y - 



max( 




> 




)■ 




2 







is its digitization by the elosest inte- 



ger point. 

When eonsidered in ? } instead of ' ^ , Equation 2 defines a eontinuous strip of 

max{\n\, In I ) 

width (measured orthogonally to the direetion of the strip) w ~ i i i , 






whieh 



ean be thought of as the eontinuous eounterpart of the naive digital line. Sueh a width 
w eorresponds to a thiekness of 1 in the direetion perpendieular to the main line diree- 
tion. 



Digital planes are defined by a similar diophantine equation and thus share similar 
properties. We eall digital plane the subset of ' ^ denoted P(n, y ,p) that verifies the 
following equation: 

y,p) y<n^x + nyy + n^z<j+ p . (3) 

In the same way as naive digital lines, digital planes verifying 
p = max(|n_j.|, \n^ ,|n^| ) are ealled naive digital planes and are simply denoted 

P(n, y ). Naive digital planes are 18-eonneeted sets with no holes for 6-eonneetivity. 
One of the most fundamental properties of digital planes is their being flinetional along 
the main direetion of their normal, i.e., if = max(|M_^.|, |n^| ,|n^| ) then for eaeh (x, y ) 

there exists a single z sueh that (x, y ,z) belongs to the digital plane. 
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In addition, P(n, y ) is the digitization by truncation of the ordinary euclidean plane 
of equation n^x + n^y + n^z = y . 

Similar to euclidean lines in ? which are commonly defined as the intersection of 
two euclidean planes, three-dimensional digital lines can be conveniently represented 
by the intersection of two particular digital planes though such a definition is not ideal 

3 

from a theoretical point of view [5]. Thus a digital line of direction n e ' with 

3 

|n^| ,|n^| ), denoted D(n, y ,v) is the set of points (x, y ,z ) of ' ver- 
ifying: 



D(n,y ,v): 



y < - n^x + n^z <y + n^ 

V < - n^y + n^z < v + 



(4) 



When considered in ? instead of ' ^ , Equation 4 defines a generalized continu- 
ous cylinder having for intersection with the plane xOy (main plane orthogonal to the 
axis Oz along which the direction of the line has its biggest coordinate) a unit square. 
Thus a digital 3D line can be seen as the set of integer points contained in such a cylin- 
der. 



3 Digitization of cubic Bezier curves 

There exists essentially two different approaches to the scan-conversion of Bezier 
curves and surface patches [ 6 ]. The first one uses the parametric representation of the 
curve and evaluates repetitively the equations using forward differencing. Forward dif- 
ferencing is a fast and efficient technique which can be hardware accelerated. However 
it suffers from two major drawbacks: first, it is a floating point algorithm subject to nu- 
merical drifts due to error accumulation [ 2 ] and whose implementation requires great 
care and a register width depending on the number of pixels to draw, furthermore there 
is naturally no linear relation between the parameter and the coordinates of the drawn 
points. Hence a regular subdivision of the parameter interval, though simple, is partic- 
ularly inefficient since it can lead to many unneeded evaluations, drawing the same dis- 
crete point (if the parameter interval is too small) or holes in the curve, respectively 
patch, (if the parameter interval is too big). Therefore refinements such as dynamic step 
size adjustment are preferred [2, 4, 10]. But even in that case there is still a need for 
choosing a parameter increment and criterions for deciding when to scale that 
increment [4]. Unfortunately all those criterions are based on some geometrical value 
(surface of a triangle, distance between a point and a line, angle) being “small” and thus 
require determining a tolerance constant which, in practice is often chosen arbitrarily 
and whose relation to the sampling grid is not clear. 

The second approach uses the De Casteljau algorithm [3], a stable and efficient 
method with intrinsic adaptiveness to the curve. This method consists in recursively 

subdividing the control polygon (Bq, P j ,^ 2 , ^" 3 ) into two sub-polygons 




Digitization of Bezier Curves and Patches 379 



(Pq, Pq ,Pq, Pq ) and (Pq, Pj ,p\, P3 ) (Figure 1 ) where the P'j are defined as weighted 
sums of polygon vertices: 

P"^^ = aP" + (l-a)P”+i ae[0, 1], (5) 

At each step the area of a new control polygon is smaller and hence is closer to the 
Bezier arc which remains invariant along the process. The recursion can be stopped 
when the control polygon is close enough to the curve. 



P 

Figure 1: De Casteljau Subdivision 

Theorems exist that indicate when the maximum distance between the arc and the 
control polygon is smaller than s , based on the geometry of the sub-polygons [8, 1 1] 
or directly based on the initial control polygon and the recursion depth [ 7 ]. All of these 
results rely on the choice of an ad hoc constant s which makes them quite unsatisfying 
from a theoretical point of view. In what follows we show that we can eliminate the 
need for such a constant. 

We denote with [r ] the integer part of r e ? i.e., the greatest integer smaller than 

r. Similarly we denote with [R ] the integer point of ' ^ (resp. ' ^ ) whose coordinates 
are the respective integer parts of the coordinates of P g ? } (resp. ? } ). Let us con- 
sider an arc of integer endpoints (P, Q ). We call axis of the arc, the line defined by its 
two endpoints. The vector PQ is the direction of the arc. We also call width of the arc, 
the diameter of the smallest cylinder of axis (P, Q ) that encompasses the whole arc. 
And similarly we call width of a set of points of ? „^ , £ = ^ ^0 < ; < n respect to 

the direction (n^, ) g ' ^ , the diameter of the narrowest cylinder of axis (n^, ) that 

encompasses E . This width w. „ T£ ) is given by: 




dia{{nP.+nP.} 



y iy>o<i<n- 



hx + 



where dia(A ) for a subset A of ?„ is defined as: 



dia{A ) = max^ ^ ^(a )- min^ ^ ^(a ) . 



( 7 ) 
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Theorem 1. Let C be a planar arc of integer endpoints P, Q and of width w. If 
max(|PQ I, IPQ I ) 

w < ||pQ|| ^ then the best 8-connected integer approximation ofC is a na- 

ive digital straight line segment of direction PQ. 

Proof Equation 2 defines a naive digital line as the set of integer points contained in 
a continuous strip of the euclidean plane. The width w of this euclidean strip relates to 

the arithmetic thickness of the digital line p through the relation w = 2 It be- 

mav(|PQ I, IPQ I ) 

comes clear then, that if C verifies w < — then it fits within the real 

boundaries of a naive digital line and hence there cannot be a better integer approxima- 
tion to C than a naive digital straight line segment, n 



Theorem 2. Let B be a planar cubic Bezier arc defined by its control polygon 
(Pq, Pj ,P 2 , P 3 ), B can be optimally represented by a naive digital straight line seg- 
ment of direction [PqP^] iff 



dia{{n^Pi^ + nyPiy)^^i^ 3 |«y| ) 

where n^ = [P ^y]~ [P and n^ = 



( 8 ) 



Oy 



Proof Wang’s theorem states that B lies at most -max(d{P^ ), d{P 2 )) away from its 
axis where d(Pj ) denotes the distance of point P- to the axis and that this bound is op- 
timal [11]. Furthermore the convex hull containment property of Bezier curves states 
that B lies entirely within its control polygon [3]. Therefore the width of B is at most 
3/4 of the width of its control polygon with respect to the line ([Pq 1 [P 3 ]), the width 



of the control polygon being given by 
follows directly from Theorem 1 . n 









n? + 



The result then 



A control polygon verifying Theorem 2 is said to be flat and the recursive subdivi- 
sion can be stopped at that level. If the control polygon does not meet the criterion of 
Theorem 2 then it is subdivided into two sub-polygons according to (5). Equation 5 
leaves a degree of liberty in the choice of a . Optimizing this value at each recursion 
step yields a digitization with the minimum number of discrete segments. Such optimi- 
zation however is beyond the scope of this paper and in practice a = ^ is the usual 
choice. 
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Theorem 2 does not provide the complete equation of the digital line segment that 
is the best approximation of the spline arc but only its direction. The affine offset y 
must also be determined. The ideal value of y which yields a discretization by the clos- 
est integer corresponds to the midline (axis) of the narrowest cylinder of direction 
([^0 1 [^3 ]) enclosing the curve within its control polygon. In order to find an alge- 
braic formulation of y , we must distinguish two cases depending on whether the curve 
crosses the line ([T’q \ [P3 ]) or not. 

1. The curve does not cross ([Pq ], [P3 ]). In this case the curve lies between the line 
([Pq 1 [^3 ]) nnd a parallel at three fourths of the distance of the most distant point of 
the control polygon to that line (this may be Pj or Pj ). Assuming that point to be Pj , 
y becomes: 



3 + «/oy ) 




max( 




> 




) 


8 




2 



2. The curve crosses ([Pq \ [P3 ]). In this case the curve is enclosed in a cylinder of 
direction [P0P3] whose axis is midway between the points and P2 ■ Hence y be- 
comes: 



ririxPix + )+ i^xPlx + ^yPly >1 




-max{ 










L 2 J 




L 2 J 



( 10 ) 




Figure 2: Discretization of a Bezier arc 

A Bezier arc described by two flat control polygon (Pq, Pj ,P2, P 3 ) and 
(P 3 , P 4 ,Pj, Pg ). The round dots are the integer points making up the 
discretization of the Bezier arc. The dash-dotted lines represent the axes 
defined by Equation 9 while the simple dashed lines represent the real 
boundaries of the digital line segments as defined by Equation 2. 

A criterion for digitizing 3 D Bezier curves stems from the same principle as for 2 D 
Bezier curves: the De Casteljau recursion stops and the curve segment can be rendered 
as a discrete line with no loss of precision when the convex hull of its control polygon 
is bounded by the limits of a 3 D digital line as defined in ( 4 ). Denoting 
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= [^3j- ]“ ]’ ^ t^3z ]“ [*^0 J ^nd assuming 

|n^| = max{\n^^, |n^| ,|n^| ), the condition writes: 






( 11 ) 



The affine offset proposed in Equations 9 and 10 still holds, with the same restric- 
tions, in this case by considering independently the projections of the control polygon 
on the main planes xOz and yOz. 



4 Digitization of 3D cubic Bezier surface patches 

We call naive digital plane patch a non-empty 26-connected subset of a naive digital 
plane T’(n,y) defined as a non-empty polygon on that plane by the following set of equa- 
tions: 



e(n,Y,(q„ h)o<k<my- 



F(n.,)n “u=‘*‘hyy*‘hZ<>X ,12) 

?(m-l )r« + ^{m-\)yy + ?(m-l < ^3 



3 

where q,;- e ' and X f, g ' for 0 <k<m . 

Special interesting cases of naive digital plane patches include digital quadrilaterals 
(m = 4 ) and triangles (m = 3 ). 

3D bicubic tensor-product Bezier patches as well as Bezier triangles can be approx- 
imated by naive digital plane patches as defined by Equation 12. Indeed the De Castel- 
jau recursive construction is general and still applies in those cases. 



Theorem 3 . Let B be a bicubic Bezier patch defined by its control net {P^ j) ^ ■ Xet 

u ^ [1*0, 0 IP3. 0 = [Pq, 0 IP3. 3 ] n = u X v //■ 



+ + j<mav(|nJ, \n^\ ,|nj ) (13) 

then the best 26-connected integer approximation ofB is a digital plane patch of normal 
direction n. 
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Theorem 4. Let B be a Bezier triangular patch of degree d defined by its control net 
= [P 003 IP 300 ]’ V = [P003 lP030]««^n ^ uxy if 

i+j+k=d 



dia 



{ '^xiPijk X + n^iPiik X + fiyiPnk X } 



I 



yD ijkJy ' ijk^ 



0<ij,k<d <max(\r 
i+j + k = d j ' 



) (14) 



then the best 26-connected integer approximation ofB is a digital plane triangle of nor- 
mal direction n. 



Proof Since the containment property of Bezier patches and triangles within their 
control net holds, Equations 13 and 14 being verified amounts to the whole surface B 
lying between the limits defined by Equation 3 for a naive digital plane of direction n. 
n 

The value of the affine offset y of the digital plane that represents the digitization of 
B by the closest integer point is given by: 



y = 



-min{n^{Pi j ^ + n^(P. • + n^(P. • ^ ^ 



max{n^(Pi j \ + n^(P. • ^ ^ 

j_ 



(15) 



- 


-max{ 


N 


5 




5 


N 


>1 















5 Connectivity issues 

5.1 Connectivity of digitized Bezier curves 

The construction proposed previously for discretizing a 2D Bezier arc actually builds a 
continuous polygonal line (the dash-dotted line of Figure 2) whose discrete counterpart 
is the discretization of the Bezier curve. Thus the overall discretization forms an 8-con- 
nected path which is the most reasonable requirement for the discretization of a perfect- 
ly general Bezier arc. In simple cases, if the Bezier arc does not have too important 
changes in orientation, its overall discretization forms a 8-connected simple curve, i.e., 
every point of the discretization has exactly two 8-neighbors except for the endpoints 
(if the curve is not closed). 

In the case of 3D Bezier curves the proposed construction does not create a contin- 
uous 3D polygonal 3D line since the midlines of consecutive segments may not be co- 
planar. However it can be shown that the overall discretization is still 26-connected. 

Theorem 5. Let Dj and D 2 be two digital 3D lines as defined in Equation 4 and let 
and H 2 be their respective real square-based enclosing cylinders. If PI H 2 ^0 
then D[ and D 2 intersect or have two 26-adjacent points 
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Proof. If //j n H 2 7^0 then that intersection contains at least a real point 
Pq(xq, Fq ,Zq ). Since Pq is contained in the enclosing cylinder of contains 

an integer point ,Zy ) such that 

1^1 “-^o| ^ ^ 

\Y^-y^\<l (16) 

1^1 “^o| < ^ 

A similar statement holds for D 2 which contains an integer point 12 (^ 2 ’ ^2 ’-^2 ) 
such that 



1^2 "^qI ^ ^ 

|F2-yo|<l (17) 

1^2 “ ^o| < 1 

Hence /j and l 2 are at least 26-adjacent if not equal which proves the theorem, n 

Each segment of 3D digital line making up the discretization of a 3D Bezier curve 
is the set of points contained in a truncated square-based cylinder (as mentioned in para- 
graph 2). Consecutive cylinders always have a non-void intersection since they contain 
at least one common real point (the point that is common to the two consecutive Bezier 
control polygons which is also part of the Bezier arc itself). Thus, thanks to Theorem 5, 
26-adjacency between consecutive discrete straight line segments is guaranteed. 

5.2 Connectivity of digitized Bezier surface patches 

The connectivity problem becomes more intricate in the case of the digitization of Bez- 
ier surface patches where discrete connectivity has to be controlled along the whole 
length of the edges of patches. This involves determining the appropriate bounds of the 
equations defining the discrete plane patch in Equation 12. We examine the problem in 
the case of bicubic tensor product Bezier patches though the results still hold in the case 
of Bezier triangles. The problem is still mostly open in this case and we only provide 
directions towards formal proofs. 

Let B and B' be two adjacent Bezier bicubic tensor-product patches verifying 

Theorem 3 and having for respective control nets (P- ■ ) and {P'l : ) 

0<j<3 ’ 0<j<3 

such that P 3 j = P'q ,• for 0 < i < 3 (Figure 3). 

Let Q(n, y ,{q^, h\<k< 3 ^ Y' Xl'k, ’^k\<k< 3 ^ 

plane patches and let us denote H and H' their real enclosing parallelepipeds defined 
in Equation 3. We consider two different cases depending upon whether n and n' have 
the same main direction. 

1. n and n' have the same main direction. Let us assume that this direction is Oz. 

This condition writes n = max(\n \, In I ,|n I ) and n' = max(\n' In' I ,|n' I ). In 
z ^1 -^1 I aI I ^ -^11 aI I 
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Figure 3: Adjacent bicubic Bezier control nets 

that case Q and Q ’ are adjacent and their union is a 26-connected discrete surface iff 
(2 u 2 ’ still has the property of functionality of the individual digital plane patches, 
which amounts to say that Hvj H" has everywhere a thickness of 1 in the direction Oz 
and one full side of H and H" is contained in their intersection H (see Figure 4). 




Figure 4: Union of the convex hulls of two discrete plane patches 

A problem arises in the case of tensor-product surfaces which may not be planar 
even though they verify Theorem 3. This results in the choice of inconsistent normal 
values between adjacent patches that make it difficult to meet the previous connectivity 
criterion. Such a problem does not occur with triangles, since triangular patches are al- 
ways planar. Hence, one solution is to divide the tensor-product patches (which are 
quadrilaterals) into two triangles and work only with discrete triangles. 

2. n and n’ have different main directions. In the case where n and n’ have different 
main directions the previous condition does not hold anymore and we can only guaran- 
tee that Q'^ Q’ is 26-connected if // n H ’ contains at least one of the sides ofHor H’ 
and Hvj H" has everywhere a thickness of at least 1 with respect to the main directions 
of n or n ’ . 



6 Conclusion 

In this paper we have presented a method to polygonalize Bezier curves and surfaces 
into discrete lines and plane patches. Unlike existing rendering algorithms that all re- 
quire an arbitrary tolerance constant s , our approach is entirely based on the geometry 
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of the manipulated objects and theorems of discrete geometry. Moreover our termina- 
tion criterion for the subdivision of Bezier curves is optimal. The criterion, though pre- 
sented here in the case of cubic curves and bicubic surfaces, is general and can be 
extended to Bezier curves and surfaces of higher degree. Appropriate connectivity of 
Bezier curves polygonalized by our method is ensured both in 2D and 3D and we also 
provide a criterion to ensure appropriate connectivity of adjacent discrete surface patch- 
es in restricted cases. Further work is needed to determine a more general solution to 
this connectivity problem of 3D patches. 
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Abstract. We study the problem of discretization in a Hausdorff space 
followed in [WTR 98]. We recall the definitions and properties of the 
Hausdorff discretization of a compact set. We also study the relationship 
between the covering discretizations and the Hausdorff discretization. 
For a cellular metric every covering discretization minimizes the Haus- 
dorff distance, and conversely, if the supercover discretization minimizes 
the Hausdorff distance then the metric is cellular. The supercover dis- 
cretization is the Hausdorff discretization iff the metric is proportional 
to doo ■ We compare also the Hausdorff discretization and the Bresenham 
discretization [Bres 65] . Actually, the Bresenham discretization of a seg- 
ment of IR^ is not always a good discretization relatively to a Hausdorff 
metric. 

Keywords: Bresenham discretization, cellular metric, Hausdorff metric, 
lattice, covering discretization, Hausdorff discretization. 



1 Introduction 

Let K he a compact subset of fR", we investigate the following problem : 
determine the set Jv[h^{K) of subsets of minimizing a Hausdorff distance to 
K. We have shown [Wag 97] that MnaiK) is stable by union and that the union 
of all its elements is also a set minimizing the Hausdorff distance. This union is 
called the Hausdorff discretization of the compact K. We also proved that the 
“Digital geometry” converges in Hausdorff space to the “Euclidean geometry” 
as in [DT 95]. 

The supercover discretization operator Asc is defined by VAT C fR", Asc{K) = 
{p G I AT n C(p) yf 0} where for each p, C{p) is the cell corresponding to 
p. We also study the relationship between the covering discretizations and the 
Hausdorff discretization. We proved that, for a cellular metric every covering dis- 
cretization minimizes the Hausdorff distance, and conversely, if the supercover 
discretization minimizes the Hausdorff distance then the metric is cellular. We 
proved also that the supercover discretization is the Hausdorff discretization iff 
the metric is proportional to doo-We compare also the Hausdorff discretization 
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and the Bresenham discretization [Bres 65] . Actually, the Bresenham discretiza- 
tion of a segment of is not always a good discretization relatively to a Haus- 
dorff metric. 

This paper is divided in four sections. In the second section we present some clas- 
sical notions of metric space and Hausdorff space, and we introduce some new 
metric notions. In the third section, we study the problem of discretization in a 
Hausdorff space and we introduce a new discretization : Hausdorff discretization. 
In the last section, we present some applications. Actually, we prove that the 
Bresenham discretization of a segment of is not always a good discretization 
relatively to a Hausdorff metric. 

The proofs are not given here, they will appear in future papers in preparation 
[RT 98]. Some of them can be found in [Wag 97]. 



2 Metric space and Hausdorff space 

This section contains some classical notions of metric space and Hausdorff space, 
it also contains some new metric notions. The proofs of the classical notions used 
in this section can be found (for example) in [HY 88], [GK 90], [Bar 93]. 

Definition 1. Let (£, d) he a metric space and let p & £ and r G , 

Bf{p) = {x & £ \ d{x, p) < r}. 

Definition 2. Let {£, d) be a metric space and let E C £, the interior of E is 
the set int{E) = {x & E \ 3r>0, Bf{p) C E}. 

2.1 Metric space and normed vector space 

Definition 3. Let {£, d) he a metric space and let K C £; K is said to be 
compact if every infinite sequence in K contains a subsequence having a limit in 
K. 

Definition 4. Let {£, d) be a metric space and let A C £ and xq G £; d{xo, 
A) = infy(zA{d{xo, y)). 

Definition 5. A metric d on iR" is said to be invariant under translation if 

y{x, y, z) G (IR”)^, d{x + z, y + z) = d{x, y). 

Definition 6. A norm over a vector space £ is an application N : £ — > 
such that : 

-Vx G £, N{x) = 0 a; = 0. 

-WxG£,y \g m, N{\x) = |A|iV(a;). 

- V(x, y) G £^, N{x + y) < N{x) + N{y). 

{£, N) is called a normed vector space. 
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Remarks 

• If iV is a norm over E, then the function such that : Vcc, y € £, 

y) = N{x — y) is & metric over £. d^ is called the metric induced by the norm 
N. 

• A metric induced by a norm is invariant under translation. 

Examples : 

£ = iR” and let x = (xi, X 2 , ■ ■ ■ , Xn) G IR^. 

Vp > 1, ||a;|| = ^|a;i|P + --- + |a;„|P and ||a;||^ = \xi\ = lim ||x|| 

^ l< 2 <n p—^oo ^ 

are a norms over ]R^. The metrics induced by these norms are dp and doo re- 
spectively. 

Property 1. [Sto 70] 

• Let be a norm on K", and let = {x G IR"; N{x) < 1}. Then is a 
compact convex set with a non-empty interior and symmetrical relatively to the 
origin. 

• Conversely, for every compact convex set K C IR" of dimension n, which is 
symmetrical relatively to the origin, there is precisely one norm N such that 
Bat = K. Moreover Va; G iR", N{x) = inf{{r G ^x G K}) 

2.2 Cellular metric 

In this subsection we introduce a new notion : a cellular metric. We present some 
properties of a cellular metric and we compare it with the metrics di and doo- 

Definition 7. Let p G we define the the cell of center p as 

C{p) = {x G £\doo{x, p) < 

Definition 8. A metric d over IR" is called cellular z/Vx G IR”, Vp, q G 
X G C{p) d{p, x) < d{q, x). In particular, if x G C{p) C\C{q), then d{p, 
x) = d{q, x). 

Definition 9. In IR", a symmetry relatively to the hyperplane Xi = Q is the 

function (7i : IR" ^ IR" which associates to the vector {x\, ■■■, x„) the vector 
{x\ , ■ ■ •, Xn') where xf = —Xi and x/ = Xj for all j yf i. 

Proposition 1. Let N he a norm invariant under (7 • • •, <T„, then the metric 
dN induced by N is cellular. 

Proposition 2. If d is a cellular metric in IPf and d is induced by a norm N, 
then N is invariant under (7 1 and 72- 

Definition 10. Let d be a metric on IB" invariant by translation. 

• The cellular covering radius of the metric d is rcc{d) = m/({r>0 , C{0) C 
B'^{0)}) where O is the origin. 

• The covering radius of the metric d is rdd) = inf{{r>0 \ IR" = ^r{p)}) 
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Remarks 

• rc{d) < rcc{d). 

Proposition 3. Let d be a metric invariant by translation. If d is cellular then 
rc{d) = rcc{d). 

Proposition 4. If d is a metric induced by a norm N invariant under <Ji, ■ ■ ■, 
(J„, then rc{d) = N{\, \, ■ ■ ■, \) = \ N{1, 1, ■■ ■, 1). 

1 

Examples : In K", Vp > 1, rddp) = (^), and rddoo) = 

Corollary 1. If d is a metric induced by a norm N and d is invariant under 
ai, ■■■, a„, then (O) C 

We introduce now some restrictions in order to compare the metric d to the 
metric di. 

Let be a group of isometries of iR" (for the euclidean norm) preserving the 
canonical basis {ci, • • •, €„} : the elements of ff are the linear applications whose 
the matrix is a permutation matrix (with coefficients in {0, 1}, and exactly one 
“1” by line and exactly one “1” by column). 

Let G{N) be the group of isometries in for which N is invariant : 

g(N) = {d e J \ yx e N{g{x)) = iV(a;)} 

Proposition 5. Let N be a norm on K" invariant under (Ti, ■ ■ ■, <T„. If G{N) 
is transitive on the basis {e\, • ■ ■, Cn}, (i.e. Vi, j G {1, • • •, n}, B-ip € G{N), 
'4’di) = Cj ), then 

C C Bf{0) 

Remarks : With the same hypothesis as in Proposition 5, we have : 

Let p, q G 

" If di{p, q)>n, then n = 0- 

- If di{p, q) = n, then Bl)^(^^-^{p)nB‘(:^f^^^{q)nC{p) = C{p)nC{q), which is nonempty 
iff p and q are diagonally adjacents except for d =cydi. 



2.3 Hausdorff metric 

The definitions and results presenting in this subsection can be found in [GK 
90], [Bar 93]. 

Definition 11. Let {S, d) be a metric space, H{S) is the set of the nonempty 
compact subsets of S. 

On 'H{E), we will define a metric Hd, such that if {£, d) is a complete metric 
space then {Ti.{£), Hd) is a complete metric space. 
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Definition 12. Let {£, d) he a metric space. Let A, B G 'H{£). We define the 
oriented Hausdorff metric from a set A e 'H{£) to a set B G by : 

hd{A, B) = maxaeA(minbgB(rf(a, b))) = ma,^aeA{d{a, B)) 

Definition 13. Let {£, d) be a metric space. The Hausdorff distance between two 
compact sets A, B G TL{£) is defined by Hd{A, B) = max{hd{A, B), hd{B, A)) 



Definition 14. Let A, B be subsets of ; the Minkowski addition of A and B 
is A(BB = {a + b\ aG A, b G B} = IJaeA = Uhcs where Vf G fR", 
A(t) = A0 {t} is the translation of A by t. 

Property 2. Let d be a metric on IR" and let A, B G 

Hd{A, B) = min{{r > 0 | A C l^f{b) and B C B'}{a)}). So, if the 
metric d is invariant under translation, then 

VA, B G nilT), Hd{A, B) = min({r > 0 | zl C B 0 Bf{0) and B C 
A®Bf{0)}). 

Property 3. [GK 90] , [Bar 93] 

Let {£, d) be a metric space such that d is invariant by translation : 

• V(A B) G n{£fi, Vr > 0, hd{A, B)<r^ACB® Bf{0). 

• W{A, B, C, D) G Hd{A\JB, CUD) < ma.x{Hd{A, C), Hd{B, D)). 

• {£, d) is a complete metric space 4=^ {TL{£), Hd) is a complete metric space. 

3 Hausdorff Discretization 

We recall here the results presented in [WTR 97]. 

In all the following we assume that we have as metric space (iR”, d), where d is 
a metric induced by a norm on IR” , and as a discrete space . 

3.1 Morphological discretization 

In this subsection we introduce the morphological discretization studied in [Hei 
91a], [Hei 91b] and [Hei 92]. 



Definition 15. Let K C iR" and S C iR", the discretization by dilation of K by 

S is the set = {K 0 5) n where S = {— s | s G 5}. S is called the 

structuring element. 

Property 4- Let K C iR" and let S C iR", then = {p G Z" \ KnS{p) yf 

01 - 

Remarks, definitions and notations: 

Let K C iR", c? be a metric induced by a norm, and let S C iR" : 

• Let r G iR'*’. For S = Bf:{0), is called the discretization of K of radius 

r for the metric d and it is denoted in the following by 
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• For S = {0}, is the discretization, by sampling, of K. 

• For r = rcc{d), 2\f(K) is called the discretization, by overlapping, of K. 

• For d = doo, = {p g Z"' \ C{p) n iC ^ 0} is the supercover 

discretization of K [CK 95] which is denoted in the following by /\sc{K)- 

• Let S C if Vp G S', iC n C{p) yf 0 and K C C{p) then S is called a 
covering discretization of K [And 98]. So, the popular supercover discretization 
is the maximal covering discretization. 

3.2 Characterization of Hausdorff Discretization 

In this subsection we characterize the set A4h^{K) of subset of minimizing 
the Hausdorff distance to a given compact K, we have proved [Wag 97] that 
AiHa(K) is stable by union and that the union of all its elements is also a set 
minimizing the Hausdorff distance. 

Definition 16. Let K G 7t(iR”). A set M C is a Hausdorff discretizing set 

ofK if Hd{K, M) = inf{{Hd{K, ^) | 5 C ^"}). 

• The value th{K, d) = inf{{Hd{K, S') | S' C ^”}) is called the Hausdorff 
radius of the compact set K for the metric d, 

• MhAK) = {M C Hd{K, M) = inf{{Hd{K, S) | S C ^”})} is the set 
of subsets of Z"' minimizing the Hausdorff distance to K. 

• = (UmcA^// (k) called the Hausdorff discretization of K. 

Property 5. Let K G then 

• is non void and finite, 

• VMi, Ma G MhAK), Ml U Ma G MhAK) and 

• ^hAK) G MhAK)- 

We will now characterize the Hausdorff discretization. 

Theorem 1. Let K G then 

• rniK, d) = inf{{r >0\K C Upezid(iy)^r (p)}); 

• ^hAK) d)(K) = {p&D \ A 0} and 

• rniK, d) < rAd). 

Property 6. Let K G H{MA, r G and let S C such that K C Bf(p) 
and Vp G S, Bffp) f] K A then HAK, S) < r. So, if r = vh{K, d) then 
SgMhAK). 

Corollary 2. Let d be a cellular metric. Then for every compact set K, if S is 
a covering discretization of K then S G MhAK)- 

The results obtained for the Hausdorff discretization in the lattice can be 
extended, in a natural way, to any lattice. In the following theorem, we prove 
that, the “Digital geometry” converges in Hausdorff space to the “Euclidean 
geometry” as in [DT 95] by using lattices with increasing resolution. 
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Definition 17. Let m € IN , the square lattice with step ^ is the set = 
2 ^^”. 

Proposition 6. Let K e Lf^m e IN, Ah,j{K, R^) is the Hausdorff 

discretization of K in the lattice Rm, then lim H,{K, Ah,{K, R„)) = 0. 



3.3 Hausdorff discretization and supercover discretization 

The supercover discretization operator A sc is defined by 

VK G Asc{K) = {pez^\Kn C{p) ^ 0}. 

In this subsection, we study the relationship between the supercover discretiza- 
tion and the Hausdorff discretization. We present a localization of the Hausdorff 
discretization. We have shown that the supercover discretization minimizes the 
Hausdoff distance iff the metric is cellular, and we prove also that the supercover 
discretization is the Hausdorff discretization iff the metric is proportional to d^o- 



Proposition 7. Let d be a metric induced by a norm, then \/K G 
AhAK) cAscim Z\sc«,(d)(0)). 

Theorem 2. Let d be a metric induced by a norm on IR". 

• d is a cellular 4=^ WK G H(IR^), Asc{K) G 

• VK G AhA^) =^sc{K) 4 =^ 3oiG ]R^ such that d=adoo- 

The topological properties of the supercover discretization are studed by several 
people. [Sch 98] shows that, given a Euclidean set K of IR^, under some condi- 
tions on k, points can be removed from its supercover discretization Asc{K), 
in such a way that, for the remaining subset S of points, Up6sC(l') i® homo- 
topic to K. [LCG 98] gives a sufficient conditions on K under wich Asc{K) is 
homeomorphic to K. 

4 Applications 

In all the following, we assume that all the metrics considered are induced by a 
homogeneous norm. 

Definition 18. A norm N on IR” is homogeneous if\/{xi, ■■■, a;„) G WA , 
V(ei, • • •, e„) G { — 1, 1}", for all permutations (7 , N{x\, ■ ■ ■, Xn) = N{e\Xi, ■ ■ ■, 
^nXn) = ■ ■ ■, Xa-[n))- 'S'o, if n = 2, then N is homogeneous iff'i{xi, 

X 2 ) G SA , N{xi, X 2 ) = N{—xi, X 2 ) = N{x 2 , x\). 

Definition 19. Let d be a metric on IR" and let p G IR" and r G SA , 
efip) = {a; € IR" 1 d{x, p) = r}. 
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Property 7. Let d be a metric on JR” induced by a homogeneous norm, then 

• H-1. 1}” C n Cf-{p)) and 

• |{— 1, 1}" C (C'rc(<i)(p) 3 Cte JR'*" such that d =adoo- 

• If K eH{M^),tLnrH{K, d) =rc{d) ^ RT n ({(i, • • •, i)} © ^”) 7^ 0 

Notations : 

Let d be a metric on and let p G 

• N{p) = lR^ \ 

• V8(p) = {qG I doo(p, q) = 1} and 

• V4(p) = {qG I di{p, q) = 1}. 

M{p), V 4 (p) and Vs{p) are illustrates in the Figures 1, 2. 




Fig. 1. N{0) for the metric d 2 - 



1 



I I I 

I I I 

(M 



Fig. 2. (a) correspond to V4(0), (b) correspond to Vs(0) 



Property 8. Let d be a metric on JR^ induced by a homogeneous norm, and let 
p ^ q G and r G JR^. 

• If r<rc{d), then {B^{p) n B^{p) yf 0 q G V^^p)), 

• If r = rc{d), then {Bf{p) n Bf{q) yf 0 q G Vs(p))- 

Property 9. Let d be a metric on JR^ induced by a homogeneous norm, and 
let p G then N{0) is an open subset of JR" and (A/’(0) = 0 4=^ 3 
aG such that d = Ctdi). 

Definition 20. Let d be a metric on M"' and K G 7d(JR"), the skeleton of 
^hAK) is the set 

Sk{K, d) = {qGZ^ I yS C Hd{K, S) = rniK, d) ^ q G S} 
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Definition 21. Let K be a connected subset of and S a square of Hdf , we 
say that K goes throught S if3p, q G such that p q, p, q & {Kr\{S\int{S) ) 
and [p, q] r]int{S) 7 ^ 0, where [p, q\ = {tp+ (1 — t)q \ 0 < t < 1} zs the segment 
of of end points p, q. 

Figure 3 present a curve goes throught a square and a curve not goes throught 
a square. 




(a) (b) 

Fig. 3. in (a) the curve goes throught the square, 
in (b) the curve not goes throught the square 



Property 1 0. Let d be a metric on IP? induced by a homogeneous norm, and let 
K G T-L{1P?). If K is connected and rn{K, d) <rc{d), then 

K goes through C{p) p G Sk{K, d) 



4.1 Hausdorff discretization and Bresenham discretization 

In this subsection, we compare the Hausdorff and Bresenham discretizations 
[Bres 65]. Actually, we prove that the Bresenham discretization of a segment of 
is not always a good discretization relatively to a Hausdorff metric. 

In all the following, we consider only the segments of IP? of the form {(a;, ax + 
6 ) I cc G [a, P]} where jaj < 1, a, /9g IP and P~a> 3. 

Definition 22. [Bre 65] 

Let L be a segment of IP? 

^Bres{h?) = {(b L®* + ^ + ) I L*^J — * — \ P']} where ( Wx € IP, \x\, 

[a;] G Z , [a;] < x<x + 1 and [x] — Kx < [x] ). ^Bres{h?) is called a 

Bresenham discretization of the segment C. 

Definition 23. Let d be a metric on IP? induced by a homogeneous norm, and 
let E = {(x, y) G | y = -x+l}. 

• A{d) is the point of E of minimal x-coordinate (i.e. if A{d) = (xq, yo), then 
xo = min{{x \ (x, — x + 1) G E}) ) 

• P{d) is the square with the set of vertex V{P) = {(xq, yo — 1), (— xq, yo — 1); 
(xq, 1 — yo), (—Xo, 1 — J/o)} P-e- P(d) is the convex hull ofV(P) ) 
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(a) (b) 

Fig. 4. (a) The black area represents A{d), (b) The black areas represent TZ{d)s. 

Property 11. Let d be a metric on IP? induced by a homogeneous norm, and let 
£ be a segment in IP^ . If rniC, d) = rdd), and C goes through C{p), then 

p G Sk{C, d) C f] {{p} (B int{TZ{d))) d ^ 

where int{TZ{d)) is the interior of the set TZ{d). 



Theorem 3. Let d be a metric on IP? induced by a homogeneous norm, and let 
C = {{x, ax + b) \ X G [ct, (3\\ be a segment in IP? . 

• I/rniP, d)<rc(d), then 

Hd{ABrea{C), £)>r//(£, d) 3p, q, r G q, r G V^p), r G (Vs(g) \ 

V 4 (g)) and C goes throught C{p), C{q), C{r). 

• d) = rdd), then 

Hd{ABres{C), C)>rH{C, d) 3p, q, r G E? q, r G Vdp), r G (Vs(g) \ 

V 4 (g)) and C goes throught {p} 0 TZ{d), {g} 0 P{d), {r} 0 P{d). 

A different cases of Theorem 12 are illustrated in the Figures 5, 6. 





Fig. 5. (a) and (b) illustrates a different cases of theorem 12 where rrf(£, d)<rc(d). 
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(a) (b) 



Fig. 6. (a) and (b) illustrates a different cases of theorem 12 where d) = rc{d). 



5 Conclusion 

We have introduced a new theory of discretization, based on the Hausdorff dis- 
tance. We have proved the convergence (in Hausdorff metric sens) of the dis- 
cretization to the original object when resolution tends to zero. We have com- 
pared the Hausdorff discretization to other discretization schemes such as the 
covering, the supercover and the Bresenham discretization. We have proved that 
the supercover discretization is Hausdorff discretization for all compact set k 
(i.e. /\sc{K) € Mhci{K)) iff the distance is cellular; and that for a distance d 
induced by a norm, Ah^{K) =Asc{K) for any compact set K iff the distance 
d is proportional to doo- We have olso proved that Bresenham discretization of 
a segment of Bd? is not always a good discretization relatively to a Hausdorff 
metric. 

Further investigations will be needed: 

• on the topological properties of Hausdorff discretization (cfr. [LCG 98], [Sch 
98]^); 

• on its extension to grey-level images; 

• on the discretization of geometrical and morphological operations. 
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Abstract. Keller’s Geometrical Theory of Diffraction [8] allows to ren- 
der scenes with dihedron diffraction account. The Diffraction algorithm 
presented in [2] is too slow, since its complexity is linear with respect 
to the number of dihedra. In order to accelerate it, we propose to re- 
duce the complexity with a discrete based algorithm. Considering that 
diffraction mainly occurs inside the n-first Fresnel’s ellipsoids [11], we 
can limit the diffraction computation to dihedra inside such ellipsoids. 
For efficiency we propose to use an ellipsoid approximation, the discrete 
tube. We describe two different algorithms for computing such a discrete 
tube. Their results are discussed, and show an important acceleration 
compared to the previous method. 

Keywords: Discrete Algorithm - Rendering techniques - Diffraction - 
GTD “ Ray- Tracing 



1 Introduction 

Diffraction is the electromagnetic phenomenon which explains the wave scat- 
tering in object vicinity [3]. At visible wavelength this effect is generally of low 
importance and is usually neglected. Some physical simulations however imply to 
compute the wave propagation as accurately as possible, e.g. in avionic system, 
or in architectural design. 

Since for high wavelengths diffraction is of great importance, an other demand 
comes from wireless phone providers : to ensure good communications at every 
location in their covering areas, they need to put many transceivers in cities. 
In order to reduce their number, they have to compute the transmission quality 
between a fixed transmitter and a mobile one (a wireless phone). Their goal is 
to minimise the fixed transceiver number while keeping a good city coverage. 

In order to render image synthesis scenes with diffraction account, we have 
proposed [2] a solution based on Keller’s Geometrical Theory of Diffraction [8] . 
This solution allows to compute energy transport with diffraction account not 
only at the camera like in Nakamae’s method [10], but also at each scene points. 
In image synthesis this solution allows for instance to compute shadows with 
more precision. Nervertheless our solution required an important computation 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 413—424, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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Fig. 1. Ray classification : direct rays are continuous, reflected rays are dotted 



time : at each point, computation complexity is linear with respect to the number 
of dihedra. Rendering complex scenes implies high computation time. Since the 
diffraction mainly occurs in the n-first Fresnel’s ellipsoids, we propose to use 
a regular grid and limit dihedron edge diffraction to a discrete tube which is 
the parallelepipedic cover of an ellipsoid. We choose to use such a tube since 
the computation of a discrete ellipsoid is much more complex and appears to be 
superfluous. 

After describing two algorithms that compute such discrete tubes, we present 
some results and discuss the acceleration factors we have obtained. 

2 The problem 

2.1 Diffraction and Ray- Tracing 

The Ray-Tracing algorithm [12] is based on classical optics which is a geometric 
theory where light is propagated in vacuum or in an homogeneous medium along 
straight lines called rays. The foundation of this theory is Fermat’s principle 
which assumes that light path lengths are extrema. A ray classification can be 
made by separating direct rays, which are straight lines between the source points 
and the receiver points, and reflected rays which are straight lines between the 
reflection points and the receiver points. Thus light paths can be composed of a 
single direct ray, or a direct and one or more reflected rays (cf. fig. 1). 

Classical optics does not allow to compute a realistic light propagation. In- 
deed the diffraction phenomenon, which explains light scattering in an object 
vicinity, cannot be rendered this way. Other theories, like Kirchhoff’s [3], or 
Lorentz’s microscopic one [7], allow to include such phenomena into simula- 
tions. They imply however the implementation of a new algorithm class, that 



The Discrete Tube 



415 



produces results at a cost of a very large computation time. Our experiences 
have shown that they have a very bad convergence (so computation time is ex- 
ceedingly large) and need high floating point precision. Therefore they can not 
be used directly in image synthesis because of their complexity. 

We have chosen an other approach in order to produce realistic images. In- 
deed, the introduction of the diffraction phenomenon in the ray-tracing algo- 
rithm [2] is based on Keller’s Geometrical Theory of Diffraction (or GTD) [8] 
[9]. This theory extends the Geometrical Optics with a new class of rays, the 
diffracted rays, which are emitted by points localized on the object outline called 
the diffraction points. The foundation of this model is the extension of Fermat’s 
principle. It assumes that light paths, which can be composed of a direct ray, 
or a direct and a combination of reflected rays and diffracted rays, still have ex- 
tremum lengths. With a direct application of the extended Fermat’s principle, 
such paths can be found by minimising the light paths for simple geometrical 
shapes, like dihedra [2]. 

With this theory, the diffraction treatment can be added into the ray-tracing 
algorithm [12]. More precisely, we can compute the direct diffracted illumination 
at a point by finding all the diffracted paths (with only diffraction points, no 
reflection point) between a source point and a receiver one. This computation is 
done in the Whitted’s shade [12] [6] function, the goal of which is to evaluate 
the radiance received at a given point. 

2.2 Complexity of the Diffraction Algorithm 

The Diffraction Algorithm [2] can be written as : 



1 For all ordered combination of n dihedron edges do 

2 Compute the corresponding light path 

3 If this path exist 

4 Compute the carried radiance 

5 Endlf 
6 EndFor 



Fig. 2. The diffraction algorithm 



The main advantage of such a solution is to be faster than a basic Monte- 
Gar lo approach based on Kirchhoff’s Theory [3]. However, the use of Keller’s 
theory implies large computation even with low complexity scenes. The profil- 
ing of our diffraction algorithm shows that we spend an important part of the 
time in the search of diffracted paths. As for Whitted’s ray-tracing algorithm, 
the complexity is directly related to the number of scene objects. This appears 
obvious if we consider the first line of the diffraction algorithm (fig. 2). In order 
to produce realistic images with diffraction account, we have to accelerate this 
algorithm. 
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The first idea is to use a similar acceleration method than for the Ray- 
Tracing algorithm, i.t. an algorithm based on a spatial structure (e. 5 . octree, 
BSP, regular grid ...). Such an algorithm works for the classical Ray- Tracing 
because the light transport between two reflection points is a straight line. In 
the case of diffraction computations, the light transport between two points is 
more complex. Therefore this method can not be used directly in our diffraction 
algorithm. 

Our idea is to search only for the diffraction paths that are inside an ellipsoid. 
Indeed with the Huygens-Fresnel construction [3], it can be shown that the 
main part of the electromagnetic energy is propagated into the n-flrst Fresnel’s 
ellipsoids (95% in the first one). Moreover, it is easy to see that the GTD formu- 
lation keeps these properties. In our diffraction algorithm, we propose to limit 
the computations between a point source and a receiver point to paths that are 
strictly inside such an ellipsoid. This can be done since we have the following 
restriction : it is only necessary to consider path parts with no reflection point ; 
indeed, in this case such a path can transport a very important radiance whereas 
at least one of these points are not in the first ellipsoids. 



2.3 Towards a discrete solution 

We have seen that we can limit diffraction computation to dihedra that are inside 
an ellipsoid. This implies a small error on the received radiance, but we assume 
that, if we take for instance the ten first Fresnel’s ellipsoids, this error will 
be invisible for the human eye. The naive method for such a limitation is based 
on the analytical finding of the intersection of a dihedron and such an ellipsoid. 
There are two reasons that make it a bad solution. Firstly it implies to work 
with all the dihedra, so the complexity will be roughly the same than for the old 
Diffraction Algorithm. Secondly the cost of the analytical intersection between 
a line and an ellipsoid is close to the cost of the algorithm which computes and 
verifies the existence of a diffracted path. Our goal is to work with as few dihedra 
as possible in order to reduce the diffraction treatment complexity. 

We thus propose to store all the dihedra into a regular grid. This may allow 
us to efficiently find those that are important in the computation of the light 
propagation between two points. By finding the intersection of the ellipsoid and 
the regular grid, we will reduce the number of dihedra involved in the diffraction 
transport. Thus, we will get an acceleration for our diffraction algorithm. 

Our problem can be sumed up as the ability to compute the intersection of 
an ellipsoid and a regular grid as fast as possible. This problem is equivalent 
to finding the most efficient discretization of an ellipsoid. Because our goal is 
to produce an acceleration for the diffraction algorithm, instead of computing a 
discrete ellipsoid, we propose to compute a discrete parallelepipedic cover that 
we call the “discrete tube”. This is simpler to compute and thus faster, and it 
is easy to see that this does not mean more errors in the diffracted radiance 
computation since we may only get more dihedra, and thus more precision than 
with the actual ellipsoid. 
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3 The discrete tube 

In this section we present two algorithms that allow us to compute such a tube. 
Both their complexities are quite similar, but the second one is slightly faster. 
However a modified version of the first algorithm could be used for the accelera- 
tion of beam tracing [5] . These algorithms are based on the efficient computation 
of a 3D six-connected line [4]. 

3.1 First algorithm : Discrete Line Scanning (DLS) 

The main idea here is to construct discrete rectangles which have to be, firstly, 
centered on each voxel of a 6-connected discrete line and, secondly, orthogonal 
to this line. If we do that for all the voxels of a 6-connected discrete line segment, 
we obtain a discrete tube. For this construction we have to define the discrete 
plane that will become the support of such rectangles. As we will see later, we 
choose to use the naive plane, which is a particular case of the discrete analytical 
plane [1], for an efficient rectangle construction : 

Definition 1. (3D Discrete analytical plane [1]) A 3D discrete analytical plane 
P = P 3 {ao , . . . , « 3 , w) is the set of points X = {xi,X 2 , x^y € that verify this 
equation : 



3 

0 < ao + ^ OiiXi < to, 

i=l 

where u) G IN* is called the arithmetical thickness, ( 01 , 02 , 0 ^ 3 ) G o,re called 
the coefficients, ao G ^ is called the translation constant of the plane, with 
gcd{ai,a2, 03 ) = 1 - 



Definition 2. (Naive plane [1]) A 3D plane is called a 3D naive plane if to = 

maxi=i„.3(|ai|). 

Such a naive plane has an interesting property that will ensure we do not 
miss any discrete tube voxel. Indeed, a proposition from E. Andres states that 
the discrete 3D planes (and so the 3D naive planes) tile the space : 

Proposition 1. (Tiling ofZ.^ [1]) 

+ 00 

= y Po{ao + tuj,ai,a2,ao,uj). 

t— — oo 

Moreover, since 3D planes do not have 6-tunnels, by scanning the voxels of the 6- 
connected line, we ensure that we do not miss some planes, and so some discrete 
tube voxels. 

All these theoretical bases allow us to write our algorithm. After an initialisa- 
tion stage, an incremental part does the scanning of the discrete 3D line between 
the transmitter point E and the receiver point R. 
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Initialisation stage In the stage, we first have to initialize the computation 
of the line segment between the points E and R. This is equivalent to the well- 
known algorithm for drawing such a discrete line [4] . 

Next we continue with the preparation of the computation of the successive 
parallel naive planes Pi. We thus determine the first voxel Vq = (xi,X 2 ,x°)'^ 
that contains the point E. 

Pq is the naive plane perpendicular to the direction ER which includes Vq : 
its coefficients (ai,a 2 ,cT 3 ) are in a first step set to the discrete coordinates 
(a, b, c) of the vector ER ; in a second step they are reduced in such a way that 
gcd(ai, 02 , 0 : 3 ) = 1. The translation coefficient of Pq is easy to compute since 
the voxel Vq belong to Pq 1 indeed we have Og = — The arithmetical 

thickness to of Pq is computed according to the definition 2 . 



Incremental stage This stage is based on the 3D 6 -connected discrete line 
construction. 

For each voxel Vi of our discrete line C, we compute the naive plane Pi 
which, firstly, contains Vi, and, secondly, is parallel to the first naive plane Pq 
defined above This plane has the same coefficients than Pg. It differs only by 
its translation coefficient ag (see proposition 1). We have : 

f 1 if I", ^ P,_i 
( 0 otherwise. 

If the intersection of the naive planes Pi and Pi+i is null, we have to compute 
the current rectangle. Assuming that the largest coefficient of our naive planes 
is «i, the rectangle projection on the (y, 2 :)-plane allows us to compute it easily^. 
The rectangle is centered on the voxel Vi. We compute the projected rectangle 
width as : 



with 



Vi > 0 , 



5 ^ = 



width 



Il^ll 



where du is the discrete tube diameter, is the x-coordinate of the IR^-vector 
ER, and the braces denote the ceiling function^. Since the rectangle belongs to 
a naive plane, and since the arithmetical thickness to is equal to a\, each pixel of 
our projected rectangle leads to its corresponding 3D voxel in a non-ambiguous 
manner. We have : 

^ It is important to notice that two consecutive naive plane Pi and Pi+i based on 
two consecutive voxels can be identical. So the recursive algorithm has to work both 
on the discrete line voxel Vj and on the successive naive planes term t defined in 
proposition 1 . 

^ Construction is similar with 02 or 03 equal to max(|ai|, |a2|, |«3|). 

® The ceiling function IR Z returns [x] = inf(n £ IN | n >= x) 
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a 2 - y + az- z + u 
Oil 



where the braces denote the floor function‘d. We get all the x-coordinates with a 
double loop on the y and z coordinates. 

It is important to notice that this method for computing such rectangles works 
well since we choose to use naive planes. With a greater arithmetical thickness, 
e.g. for standard planes, the previous method cannot be used since for some 
couples (y,z) there are more than one solution for x. 



3.2 Second algorithm : Discrete Rectangle Scanning (DRS) 

The second discrete tube generation algorithm is a variation of the previous one. 
The main idea here is to extrude the first rectangle centered at Vq along the 
6-connected discrete line [4] that passes through the transmitter point E and 
the receiver point R. 



Initialisation stage It begins as for the DLS algorithm by the definition of the 
naive plane Pq. We then compute the 6-connected discrete line segment (OP) 
which is the translation of the discrete line segment {ER) to the coordinates 
system origin {i.e. OP = ER). Next we store® the result in a voxel list C. Such 
a voxel list can be considered as a displacement list. 



Incremental stage For each voxel of the naive plane Pq we compute the voxels 
corresponding to the translated line C. 

The incremental stage is really easy to implement : this explains that this second 
algorithm is a little more efficient than the previous one. An important question 
must be asked : did we miss, with this procedure, some voxels of the discrete 
tube ? In fact we do not miss any voxel, since we use a 6-connected line and a 
naive plane. Indeed the intersection of a naive plane and a 6-connected line is 
at least one voxel [1]. Our algorithm can be considered as a rectangle extrusion. 
On the contrary, some voxels are identified twice or more. Fortunately it is 
not a problem : the regular grid mapping of dihedron edges and the diffraction 
treatment imply to mark the edges that are inside a discrete tube. Indeed an 
edge can be included in two or more discrete tube voxels, but needs only to be 
treated once in the diffraction algorithm. 

^ The floor fonction IR Z returns [a;J = sup(n £ IN | n <= x) 

® A possible way for optimizing an algorithm is to minimize the dynamic memory 
allocation. Fortunately it can be noticed that the regular grid size is fixed. Thus it is 
possible to allocate once and for all this voxel list memory if we consider the greatest 
6-connected discrete line : the maximum number of voxels for such a line is smaller 
or equal to -I- Sj, -I- , where (Sj,, Sy, s^) are the side lengths of the grid. 
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Algorithm 


Total time 


Diffraction Computation 


Acceleration 


Classical 


6h 57’ 


6h 41’ 


- 


DLS 


51’ 


35’ 


11.5 


DRS 


42’ 


26’ 


15.5 



Table 1. Comparison between the three Diffraetion Algorithms for the interior scene 



Algorithm 


Total time 


Diffraction Computation 


Acceleration 


Classical 


12h 22’ 


12h 18’ 


- 


DLS 


3’ 33” 


1” 


44300 


DRS 


3’ 33” 


1” 


44300 



Table 2. Comparison between the three Diffraction Algorithms for the car scene 



4 Discussion 

These two algorithms have been implemented in our Ray- Tracing software. For 
the discrete tubes diameter dr, we use a function of the ellipsoid diameter, which 
depends on the shortest distance d{E, R) between the transmitter point E and 
the receiver point R, the wavelength A of the propagated wave (we take the 
maximum of the light wavelengths, i.e. A = maxi(Ai) ~ 780 • and the 

power <P radiated at E : 



4r . . I 1 L 

— = 1 -|- mm I 1 , — 



d{E, R) 



where s is the grid size. We chose such a solution in order to ensure the fact that 
we do not miss any continuous tube voxels. 

Our experiences show that with n = 1 we obtain a good realism. In order 
to ensure to take almost 100 percent of the diffracted radiance in all cases, we 
chose to take n = 10 in our implementation (so the ten first ellipsoids). As our 
images show (c/. figure 3), this approximation is good since the human eye can 
not see any difference® between the first one which is computed with the classical 
diffraction algorithm, and the second one which is computed with our spatial 
acceleration (DRS and DLS algorithms produce the same results). 

The raison d’etre of these two algorithms is to get an acceleration of the 
diffraction treatment for a polygonal scene. To study if such an acceleration is 
obtained, we tested it on different images. The table 1 shows the results for a 
simple scene, the interior scene (see figure 3). In this case the acceleration factors 
for our two algorithms are greater than 10. They are relatively small since, 
firstly, they are only 346 dihedra, and, secondly, the discrete tube diameters 

® In fact this difference is below 1 percents 
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Fig. 3. The interior scene rendered with (top-right) and without (bottom-right) grid 
acceleration. We can observe on the wall diffraction by the sun through a shutter. We 
can observe on the two right images that there are no difference between the previous 
and the new result. 



are relatively large. Indeed the diffraction is due to the sun, which has a great 
radiance power ; moreover the distances between the transmitter points and the 
receiver points are very large since the room lies on earth. 

An other test scene (see figure 4) is composed of a car in a closed room. The 
light transmitter is a lamp inside the room, of relatively poor power (60 Watts). 
The scene contains 10476 polygons and 15710 dihedra. As the table 2 shows, our 
algorithms produce an impressive acceleration. The reason of such an accelera- 
tion is very simple : the power emitted, the distances between the transmitters 
and the receivers, and the light power are small ; so the tube diameters are very 
small too ; therefore we avoid most of the dihedron edges at each diffraction 
computation because the car is a very complex object, composed of more than 
ten thousand polygons. 

In fact when a scene is composed of many polygons, a regular grid and the 
discrete tube algorithm lead to a really important acceleration factor. 

In order to choose between the two proposed algorithms in an implemen- 
tation, we need to discuss on which is the fastest one. The figure 5 shows the 
comparisons of these two algorithms with the interior scene (see figure 3). The 
x-axis denotes the grid size. It varies between 1 to 100. An attentive reading 
of this figure shows that the DRS algorithm becomes faster than the DLS one 
when the grid size becomes larger than 35. Therefore the authors prefer to use 
the DRS algorithm for its efficiency. In terms of implementation time, this al- 
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Fig. 4. The car scene, composed of 10476 polygons which lead to 15710 dihedra. 




Fig. 5. Computation times for the interior seene for various grid dimension. Image size 
is 512 X 512. One ray per pixel. The x-axis denotes the grid size. The y-axis denotes the 
computation time, in seconds. The continuous lines denotes the DLS algorithm. The 
dotted lines denotes the DRS algorithm. 
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gorithm seems also to be the better choice : it is simpler to implement than the 
DLS one. The problem of the DRS algorithm behaviour is that we can not use 
an infinit grid size since computer memories have limits. 

5 Conclusion 

Diffraction is an electromagnetic phenomenon usually ignored in image synthesis. 
People considered that it was too complex for a low visual importance. Never- 
theless some situations imply to render this phenomenon. For that purpose, we 
have proposed a solution in [2], that we call the Diffraction Algorithm. This 
method has however a major problem : it is very computation time consuming. 

In this paper we present two new algorithms that make diffraction in im- 
age synthesis available at a reasonnable computation time cost. They are based 
on discrete considerations, and their results are quite similar to the classical 
Diffraction Algorithm. With respect to this last one, they show an acceleration 
factor that can be really large when diffraction is of low importance in the scene. 
As a conclusion, it is possible now to include the diffraction treatment into a 
Ray- Tracing based software : when the diffraction phenomenon is visible in the 
scene, like for instance in the interior scene, our two algorithms produce good 
results at a reasonnable cost with respect to the Ray-Tracing cost ; on the other 
hand when the diffraction account do not induce any more realism, fortunately 
our new algorithms induce a negligible computation time. 

Even though we recommend the use of the DRS algorithm, the DLS one can 
be extended to produce an acceleration of the pyramidal beam tracing [5]. In 
order to do this, and like for a classical Ray-Tracing method, we need to find 
the intersection of a pyramidal beam with the first scene object it meets. The 
classical method is to test it for all the scene objects. As D. Ghazanfarpour 
has shown in [5], a better solution is to compute a discrete beam. His method 
is not based on the discretization of such a beam, but on a grid manipulation 
that handles too many voxels, and so too many object intersection tests. Authors 
think that the DLS algorithm can be modified in order to compute such a discrete 
beam. An important advantage of such a construction would be to detect when 
an object completely occludes the beam : in these cases it is not interesting to 
make intersection tests with objects behind the occluder. Moreover this solution 
leads to less voxel manipulations, so it may be faster. 
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Abstract. A voxel object (finite set of voxels) is considered in the cuber- 
ille approach (more precisely, the 3D cell complex approach). Its bound- 
ary is a set of surfels (faces of voxels). We assume, without loss of gener- 
ality, that this set of surfels is a polyhedron whose faces are surfels. These 
faces can be agglomerated in such a way that the boundary is a poly- 
hedron whose faces are topological disks of standard arithmetic planes; 
this new kind of polyhedron is called a discrete standard polyhedron. 
Thus, these new faces are generally much bigger than one surfel, and a 
discrete standard polyhedron has generally a less smaller space complex- 
ity than the starting set of surfels. This process, called polyhedrization 
or facetization, is the 3D extension of the known polygonalization of 
2D discrete curves. The other main properties of this polyhedrization 
are the non-uniqueness, and the reversibility, i.e. starting from the dis- 
crete standard polyhedron, the boundary can be exactly computed back 
again. A polyhedrization algorithm is presented in this paper. It uses a 
recent algorithm for recognizing standard arithmetic planes. Examples 
of polyhedrizations of synthetic and natural objects are given. Examples 
of application to the visualization of the boundary of a voxel object are 
also given. 



1 Introduction 

Let us consider a finite set of voxels, called a voxel object. A voxel is a unit cube, 
or 3-cell in the 3D cell complex approach of Kovalevsky [10] [11], centered at a 
point of integer coordinates. A face of a voxel is called a surfel (which is a unit 
square), or 2-cell, an edge is called Uriel, or 1-cell, and a vertex is called pointel, 
or 0-cell. For technical purposes in the present paper, a voxel is translated so 
that it is centered at a point of half integer coordinates; thus, any pointel have 
integer coordinates and any integer coordinates point is a pointel. The (surfel) 
boundary of a voxel object is the set of surfels incident to a voxel of the object 
and to a voxel of its complementary. This is the well known boundary concept 
in the ’’cuberille” approach of 3D discrete imaging, and of the cell complex ap- 
proach. We assume, without loss of generality, this boundary is connected, and, 
moreover, it is a polyhedron whose faces are surfels; that is, the boundary is 
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a connected combinatorial 2-manifold without boundary whose faces are unit 
squares (each linel of the boundary is incident to exactly two surfels, and each 
pointel of the boundary is incident to exactly one umbrella of surfels, see defi- 
nitions in [6] [7] [8]). But we know from [7] [8] that a standard arithmetic plane 
is an infinite orientable combinatorial 2-manifold without boundary whose faces 
are unit squares. Thus, the boundary of a voxel object can be decomposed (or 
encoded) into a set of topological disks of standard planes; that is, this boundary 
can be decomposed into a new kind of polyhedron, here called discrete standard 
polyhedron : A polyhedron whose vertices are pointels, and whose faces are not 
always exactly Euclidean planar segments, but always topological disks of stan- 
dard arithmetic planes. 

This decomposition (or encoding) is called polyhedrization, or facetization. It 
is obviously non unique. It is reversible, i.e. the boundary can be exactly back 
computed by starting from the discrete standard polyhedron. This reversibility 
property is not difficult to prove; but it is not addressed here. Note that it 
is not the case for the various methods, e.g. the ’’marching cube” algorithm, 
that produce a Euclidean polyhedron approximation of the boundary. Another 
important property of the discrete standard polyhedron is that it generally needs 
less information than a set of surfels for representing the boundary. 

The authors of [1] were the first to address themselves to the same problem. 
Another polyhedrization (of a discrete sphere only) was done in [4] [5], with 
naive arithmetic planes instead of standard planes. The solution of [1] was not 
entirely satisfactory, partly because it was not completely discrete (it uses a 
classical least square fit algorithm) . The solution given in the present paper uses 
the standard plane recognition algorithm presented in [9]; this algorithm makes 
sure that a given set of pointels belongs to an arithmetic plane and computes 
the parameters of the equation of that plane. 

Note that the problem of polyhedrization of a set of surfels is a 3D extension 
of the well studied problem of polygonalization of a 2D discrete curve; see [2], 
[5], [15], [16], [11], and their references. The present paper has the same goals 
for 3D images than [11] for 2D images; thus, it is a partial 3D extension of [11]; 
in [11], the study of the reversibility and of the complexity reduction was well 
developed; it is not the case here, and left to another paper. In the present paper, 
we restrict ourselves to the study of the feasibility of our polyhedrization. 

In section 2 the definitions and the basic used properties are given. In sec- 
tion 3 the problem is precisely stated, and the designed polyhedrization algo- 
rithm is presented and discussed. Experimental results on synthetic as well as a 
natural object are given in section 4. Application to the rendering of the bound- 
ary of voxel objects is shown in section 5. A prospective conclusion ends the 
paper. 

2 Definitions and basic properties 

The notion of surface used in the present paper is that of a two-dimensional com- 
binatorial manifold in the classical combinatorial topology [6] . The basic notions 
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are that of combinatorial 2-manifold without boundary (or closed), 2-manifold 
with boundary (or open), whose elements are vertices, edges, faces, their inci- 
dence relations, their adjacency relations, umbrella, loop or boundary of a face, 
connectivity, orientability. We use here these notions, and the notion of a topolog- 
ical disk which is an orientable combinatorial 2-manifold with a boundary of one 
loop only (or equivalently, of Euler characteristics one) . A face is a topological 
disk. 

An arithmetic plane is the set of points {x, y, z) of the discrete space 
satisfying the inequalities 



p, < ax -\- by -\- cz < II -\- oj 

where all parameters are integers and w > 0. The parameter uj is called the 
{arithmetic) width of the plane. The triple (a,b,c) is the normal to the plane; 
a,b,c,p are the coefficients of the plane. These planes are basic in the arithmetical 
geometry introduced by Jean-Pierre Reveilles [14]. A standard arithmetic plane 
is an arithmetic plane II of width w = jaj -I- |6| -I- jcj. Let us recall the result 
of [7] [8]. Let G be the graph of the 6-connectivity relation in 77, let F be the 
set of squares, i.e. cycles of 4 vertices, of G. The pair (G, F) is an orientable 
two-dimensional combinatorial 6-connected 2-manifold without boundary. 

We call discrete standard polyhedron a combinatorial orientable 2-manifold 
without boundary whose any face is a topological disk of a standard arithmetic 
plane, called a standard face. The vertices of a discrete standard polyhedron 
are integer points (here pointels), and called standard vertices', its edges are 6- 
connected paths of points called standard edges, whose extremities are standard 
vertices and not containing other standard vertices. The figure 1 shows a view 
of the standard edges of a polyhedrization of a discrete hexahedron, which is a 
two times chamfered cube; the linels of the boundary of the discrete standard 
polyhedron are displayed like 3D unit segments. 



Fig. 1. Hexahedron 
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3 Polyhedrization: Problem and algorithm 

Let V be a voxel object whose voxels are centered at half-integer coordinates 
points, and let B its boundary (see definitions in section 1). Thus the pointels 
of B have integer coordinates only. Assume B is connected (if not, we treat 
separatively its connected components). Moreover, assume B is a polyhedron 
whose faces are surfels, that is a combinatorial 2-manifold without boundary 
whose faces are surfels and whose vertices are pointels (generally, a linel of B 
can be incident to 4 surfels of B, instead of 2, and a pointel of B can be incident 
to 2 umbrellas of surfels of B, instead of just one). This last assumption is made 
in order to simplify this paper, and can easily be dropped by different methods 
discussed in [12]. Because any surfel is a topological disk of a standard arithmetic 
plane (an orthotropic plane), B is a discrete standard polyhedron whose standard 
faces are surfels, standard vertices are pointels, and standard edges are linels. 
The problem called polyhedrization is to agglomerate surfels of B in order to 
make a discrete standard polyhedron with larger faces. The reduction of space 
complexity is obvious, as well the non uniqueness of the solution. 

A simple, nevertheless instructive, example is that of an object of one voxel 
only, centered at (|j |)- Its boundary has 6 surfels; the 3 ones incident to the 

pointel (0,0,0) (resp. the pointel (1,1,1)) define an umbrella of the standard 
plane 

Q^x + y + zKi {resp. 0<x-|-y-l-2:— 1<3). 

Thus, the boundary of the one voxel object can be decomposed into two 
topological disks of standard planes (note that it is not possible for a Euclidean 
polyhedron to be reduced to two faces only) ; moreover, these planes are different 
and parallel (same normal), which is not possible for two adjacent faces of a 
Euclidean polyhedron. 

The design of a polyhedrization algorithm is rather straightforward. Here, B 
is a connected set of surfels. The set of faces of the searched discrete standard 
polyhedron is denoted by P. A face of P is represented by the quadruple of the 
coefficients of its standard plane, and by its boundary which is a loop of integer 
points (pointels). Any surfel of B must belong to one and only one face of P. 
The problem is : Given B, compute P. 

The (standard) faces of P have to be computed one by one, starting from a 
seed of one surfel (but for heuristic reasons we prefer to start with a seed of one 
umbrella, as long as possible). Thus some traversing through the surfels of B 
has to be done. For each surfel of B we try to affect it to the current standard 
face under construction f. If we can’t add any surfel of B to f, we start a new 
current face with (arbitrarily) the next non affected surfel with respect to the 
order of traversal of B. In order to assign the current surfel s to the current 
standard face f we must check whether by adding s to f 

1. we keep a topological disk; 

2. the pointels of s and of f belong to a standard plane. 

The answer to the first condition is obtained by checking whether the boundary 
of the union of s and f is a loop; it is easy and purely topological. The answer 
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to the second condition is obtained by the standard plane recognition algorithm 
presented in [9], based on the known Fourier’s algorithm for solving a set of 
inequalities. This algorithm provides not only a yes or no answer, but also the 
cone of the solutions, which is a set of inequalities for the coefficients of the 
standard plane solution, if it exists. Moreover, a numerical solution can be easily 
computed, especially a numerical normal, which is practically needed. 

Because of the first condition, the current surfel s is chosen in the set q(f) 
of non assigned surfels that are not in f and are adjacent to a surfel of f (thus 
a linel of s belongs to the boundary of f). If s is affected to f defining a new 
current face f’, the update of q(f) to compute q(f’) is by deletion of s from 
q(f) and by adjonction of a set t, maybe empty, of surfels. The set t is the set 
of all surfels adjacent to s that are not in f, and non assigned. To each surfel of 
t is associated an integer, called its depth, which is one more the maximal depth 
in q(f) (the surfels of q(g), where g is the seed, are initialized to 1). We have 
decided to constraint the choice of the current s in q(f) to be restricted to those 
surfels of minimal depth, a kind of breadth first search; in this way, the growing 
of the current face looks like the propagation of a circular wave. 

The output of the algorithm is a set of standard faces; each standard face is 
defined by its standard plane and by a circular permutation of the linels (or the 
pointels) of its boundary. These linels (resp. pointels) can be considered as the 
standard edges (resp. vertices) of the computed polyhedron. Another possible 
way is to call standard vertices only those previous vertices that are incident to 
at least three standard faces; then, a standard edge is a path of linels bounding 
two adjacent standard faces, ending with two standard vertices, and not incident 
to other standard vertices. 

It is clear that the computed P heavily depends on several arbitrary choices : 
The ordering of the traversing through B, the chosen seed for starting a new 
current face, the chosen surfel s in q, for enlarging the current face. Several 
questions occur naturally: How is it possible to control the number of faces of 
P, or the sizes of the faces, or to eliminate the smaller faces, ..., by acting on 
these choices. We do not know any answer. 

For some practical use in medical imaging (Philippe Borianne, private com- 
munication), the limitation of the size of the standard faces could be required; 
another interesting limitation is that of the depth of the surfels in q. This can 
be easily done. All details of the algorithm are given in [12]. More constrainted 
variants are given in [12], and not detailed here; one variant is in the spirit of 
[16], in order to get a better retrieval of voxel objects defined by a discretization 
on a Euclidean polyhedron. 



4 Results 

The polyhedrization algorithm of section 3 was programmed and experimented. 
Some results are now given and discussed. All the figures are obtained by the 
classical methods of rendering of polygons (fiat shading or Gouraud shading with 
a one light illumination model); all the surfels of a standard face are displayed 
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Fig. 2. Hexahedron 



Fig. 3. Sphere (R=15) 



with the same color, more or less darkened by the used model; adjacent standard 
faces have different colors. 

The figure 2 shows the standard faces obtained for a discrete hexahedron 
which is a twice chamfered cube (the voxel object is the intersection of six half- 
spaces). In this example, the number of computed standard faces is not 6, but 11. 
The number of surfels (size) of the boundary is 1962; thus, the average size of 
the standard faces is 178.36; the maximum size is 475, the minimum is 13. 

The same is performed in figure 3 for a discrete sphere of radius 15 (the voxel 
object is a bowl), and in figure 4 for the boundary of a X-ray 3D scanner image 
of a mushroom. 

For the sphere, the 48 symmetries are lost. The number of computed standard 
faces is 135. The number of surfels of the boundary is 4494; thus, the average 
size of the standard faces is 33.29; the maximum size is 94, the minimum is one; 
there are 48 standard faces of size 5 or less; the distribution of sizes is rather 
scattered. 

For the mushroom, the number of computed standard faces is 1019. The 
number of surfels of the boundary is 26392; thus, the average size of the standard 
faces is 25.90; the maximum size is 332, the minimum is one; there are 475 
standard faces of size 5 or less; the distribution of the sizes is rather scattered. 

A general observation is that the number of small standard faces is rather 
large. Decreasing this number, if possible, is an open problem. 

5 Application to visualization 

The surfel boundary of a voxel object have been rendered, in section 4, like 
a set of Euclidean polygons, by the classical methods of rendering of computer 
graphics, because surfels are 3D Euclidean polygons. Methods for smoothing the 
resulting visual staircase effects have been studied (see [13] for references and a 
new method) . The same is true for a discrete standard polyhedron, although the 
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Fig. 4. Mushroom 



Fig. 5. Sphere (R=15) standard faces 
with flat shading 



vertices of a standard face do not necessary all belong to a Euclidean plane; this 
is because some classical softwares of rendering begin by a triangulation of the 
entered polygons, and then display each obtained triangle by using an entered 
or a computed normal which can differ from the true normal of the triangle. 
Thus, it is possible to use these softwares with a standard face represented by a 
normal and a sequence of pointels of its boundary. 

We present here a few results (obtained with the rendering library OpenGL), 
and a short discussion. The practical interest of this new way of visualization of 
discrete 3D data requires a much more developed study, which will be done in 
another full paper. 

Figure 5 (resp. 8, 11) shows a flat shading view of the radius 15 discrete sphere 
(resp. the mushroom, the hexahedron). The linel length is large enough for show- 
ing up the non Euclidean planarity of the standard faces. Obviously, the visual 
defects disappear if the length of a linel is about that of a pixel of the screen. 

In order to avoid some possible defects of the previous visualization method, 
the following variant has been experimented. Instead of entering the rendering 
software with the full standard edges of a standard face, we replace any standard 
edge by the Euclidean 3D segment deflned by the two extremities of the edge; 
in other words, we enter the rendering software with, for each standard face, a 
cyclic permutations of its standard vertices. The obtained polyhedron is called a 
semi discrete polyhedron. Note that its faces are generally not Euclidean planar. 

Figures 6 and 7 (resp. 9 and 10) show respectively a flat and a Gouraud 
shading view of the semi discrete polyhedron of the radius 15 discrete sphere 
(resp. the mushroom). 

The more constrainted variant (see end of section 3) designed for polyhedra 
provides better results than those of figure 2. Figure 1 shows a vectorial view 
ofthe computed standard polyhedron, that have the same number of faces, edges, 
and vertices as the starting Euclidean polyhedron. In this case the reduction of 
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Fig. 6. Sphere (R=15) semi discrete Fig. 7. Sphere (R=15) semi discrete 

faces with flat shading faces with Gouraud shading 





Fig. 8. Mushroom standard faces with 
flat shading 



Fig. 9. Mnshroom semi discrete faces 
with flat shading 




Fig. 10. Mushroom semi discrete faces with Gouraud shading 
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Fig. 11. Hexahedron standard faces 
with flat shading 



Fig. 12. Hexahedron semi discrete 
faces with flat shading 



complexity of the polyhedrization is maximal. Figure 11 shows a flat shading 
view of this standard polyhedron, and figure 12 shows a flat shading view of the 
associated semi discrete polyhedron; this last result can be said very satisfactory. 

6 Conclusion 

We have defined a discrete standard polyhedron, a new kind of polyhedron, with 
faces that are planar for standard arithmetic planes. We have designed and ex- 
perimented a polyhedrization algorithm that produces a discrete standard poly- 
hedron, starting from the surfel boundary of a voxel object. Two properties of 
this process are very interesting: Reduction of space complexity, and reversibility. 
Examples of application to the visualization of a voxel object have been given. 
All these results are quite encouraging for further progress. 

Now, a lot of future work has to be done: 

— Theoretical and experimental study of the space complexity reduction; 

— Design of reversion algorithms; 

— Extension to the polyhedrization (encoding) of a discrete scene, and of all 
the objects of a 3D image (as was done for 2D images in [11]); 

— Exploration of the numerous variants of the algorithm (varying the arbi- 
trary decisions, defining more or less supplementary constraints), and their 
consequences on the statistics of the faces, and on their geometric properties; 

— Full development of the visualization of discrete objects; 

— Development of other applications, especially the use of discrete standard 
polyhedra like the use of Euclidean polyhedra in the held of topology based 
geometric modeling and in CAD, their use in discrete modelers, in medical 
imaging ... 

A part of this future work is in progress. 
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Abstract. In this paper, we present a new method of discrete Ray- 
Casting, using parallel rays, having the property that each point of the 
scene belongs to one and only one ray. This study is based on the theory 
of arithmetic discrete geometry. The rays are modeled by Figueiredo- 
Reveilles’ discrete naive 3D lines that tile the space and that can be 
incrementally generated. 
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1 Introduction 

Various fields of science and industry have become intensive users of advanced 
volume visualization tools. Medical imaging, for example, needs more and more 
precise and realistic views of anatomic volume data. 

The usual 3D visualization methodology consists in the following steps: First, 
raw 3D data, as a result of image acquisition, is geometrically interpolated, 
and filtered for noise reduction, before being submitted to data segmentation 
algorithms in order to retain only useful information. The next step consists in 
selecting a visualization method amongst one of the following usual classes of 
visualization algorithms: Isosurface reconstruction for surface-based rendering 
(introduced by [15] and [18]) or voxel based rendering. 

The algorithm described in this paper is a voxel based front to back algo- 
rithm, i.e. we scan the data space from the nearest voxel to the farthest. This 
class of volume rendering algorithms has been introduced by [9]. The theoretical 
background to our algorithm is fundamental results from Arithmetic Discrete 
Geometry, mainly due to [17]. The originality of this approach is the exclusive 
and intensive use of integer arithmetic which allows to handle a large amount of 
data faster than usual methods. The discrete approach has already been used in 
the past [19,14], its drawbacks and advantages have been analyzed in [5,6]. 

The originality of our algorithm is the use of Figueiredo- Reveilles [7] discrete 
lines, which are the thinnest possible 26-connected lines, that allow, through a 
3D tessellating theorem (new result), to reach all the visible voxels exactly once, 
without losses or doublings. 

The resulting discrete ray casting algorithm, can be sketched informally as: 

* Most of this development has been achieved at the LSIIT, ULP, Strasbourg. 



G. Bertrand, M. Couprie, L. Perroton (Eds.): DGCI’99, LNCS 1568, pp. 435—446, 1999. 
(c) Springer- Verlag Berlin Heidelberg 1999 
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For each intermediate screen pixel, follow the corresponding 
(if available) 3D ray inside of the data space from the front 
towards the back. If the back side is reached, then give the 
pixel a background color, else, if the object is touched, give 
the pixel the corresponding object color. 

Our algorithm, as well as all other discrete ray casting algorithms, is charac- 
terized by the fact that the complexity in time does not depend on the complex- 
ity of the scene, but only on the number of empty voxels to be scanned before 
reaching visible object voxels. This because there is no need to use heavily time- 
expensive intersection algorithms for each ray and for each object. The ray is 
stopped as soon as it reaches the object, the collision test consists in testing the 
voxel value. 

The paper is organized as follows. In section 2 and 3 we recall fundamental 
definitions of discrete geometry. Section 4 concentrates on the actual discrete 
ray-casting algorithm. 



2 Basic Definitions 



In this section, we recall some basic definitions from discrete geometry. They 
will be useful further on. Arithmetic Discrete Geometry has been introduced 
and studied in [17,1,13]. Discrete planes have been studied in [8,1,2]. 

The Discrete coordinate plane consists of unit squares, called pixels, centered 
on the integer points of the two-dimensional Cartesian coordinate system in the 
plane. 

The Discrete coordinate space consists of unit cubes, called voxels, centered 
on the integer points of the three-dimensional Cartesian coordinate system in 
the space. The pixels / voxels coordinates are the coordinates of their centers. 

Definition 1 (2D adjacency). Two pixels are 4-connected if they have a com- 
mon side. Two pixels are 8-connected if they share a common vertex or side. 

Definition 2 (3D adjacency). Two voxels are 6-connected if they have a com- 
mon face. Two voxels are 1 8-connected if they share a common edge or face. Two 
voxels are 26-connected if they share a common vertex, edge or face. 



Definition 3 (neighborhood). The set of all pixels adjacent to a given pixel 
P is called the neighborhood of P. 

Following definitions (4 and 1) of a 2D discrete line are equivalent. 

Definition 4 ([17]). A 2D discrete line is a set of pixels L{a, b, c, u) = 

{(x, y) G |0 < ax -F -F c < w }, where a,b,c G Z and w G N*. w is called 
the arithmetical thickness of the discrete line and c the translation constant. 



Result 1 ([17]) A pixel {x,y) G Z'^ belongs to the 2D discrete line L{a,b,c,uj) if 
ax-i-by-i-c ] _ ui/iere [r] , wit/i r G H, denotes the greatest integer not exceeding r. 
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Definition 5 (run). Along a 2D discrete line, functionnal over the X -coordi- 
nate, a set of consecutive pixels of same Y -coordinate is called a run. 

Using Result 1, it is easy to define the set Lj-, the discrete line parallel to 
L(a, &, c, w), by I (x, y) G k € Z. 

Proposition 1 ([17]). The set of all the tiles the plane. 

Definition 6 (naive line, [17]). When cj = max(|a|, |&|), the 2D discrete line 
L{a, b, c, to) is called a naive line. 

Proposition 2 ([17]). A naive line is 8-connected minimal (i.e. if one removes 
a single pixel, the line is no more 8-connected) . 

A special case of a naive line which appears to be one of the best discrete 
approximation to the corresponding continuous straight line among the set of 
all possible naive lines is the Bresenham line [4]. The Bresenham line was first 
described algorithmically, but we now have an analytical definition [17]. Let 
L{a, b, c, uj) be a naive line, passing through 0(0, 0), with c = [|^] , then this is a 
Bresenham line. 

As a Bresenham line L(a, 6, c, to) tiles the plane, Lk goes through any point 
{x, y) of the plane, where k = y). A discrete line is functional 

over the Ox (resp. Oy) coordinate axis if there corresponds one and only one 
pixel to any given coordinate x (resp. y) on this line. 

Proposition 3 ([3]). A 2D naive line is functional over the coordinate axis 
that makes an angle less than or equal to 45 degrees. 

Proposition 4. Let Lk{a,b,c,uj) be a naive line functional over X and b > 0. 
Given a coordinate x over X, there is only one pixel (x,y) belonging to L^: 
y = Y{Lk,x), where Y{Lk,x) = - . 

Similarly to the definition of discrete lines, following definitions (7 and 2) of 
a discrete plane are equivalent. 

Definition 7 (discrete plane, [17]). A discrete plane is a set of voxels P{a, b, 
c,d,uj) = {{x,y,z) G \0 < ax -\- by -\- cz -\- d < uj} where a,b,c,d G Z and oj G 
N*. Lv is called the arithmetical thickness of the discrete plane and d is a trans- 
lation constant. The vector (a,b,c) is a normal vector to the plane P. 

Result 2 (discrete plane, [17]) A voxel (x,y,z) G ZA belongs to the discrete 
plane P{a,b,c,d,Lo) if 

Using Result 2 it is easy to define the set Pk, the discrete plane parallel to 
P{a,b,c,d, to), by {(x,y,z) = fc }, where fc G Z. 

Following definitions and results are due to [1]. 




438 



Ivan Salam, Philippe Nehlig and Eric Andres 



Proposition 5. The set of all the tiles the space. 

Definition 8 (tunnels through a discrete plane). The plane P{a,b,c,d,uj) 
has a k-tunnel (k = 6, 18, 26) if there exist two k-adjacent voxels A{xa,VAtZa) 
and B{xB,yB, zb) such that axA + hyA+czA+d < 0 and axB+byB+cZB+d > uj. 

Definition 9 (naive plane, [1,2]). When cj = max(|a|, |6|, |c|), the discrete 
plane is called a naive plane. 

Proposition 6 ([1,2]). A naive plane is 18-connected. This is the smallest 
value of UJ for which there is no 6-tunnels. 

Proposition 7. A naive plane is functional over at least one of the coordinate 
planes Oxy, Oxz, or Oyz. 

3 Discrete Naive 3D Lines 

In this section we show some important new properties based on the definition 
of a discrete naive 3D line given by Figueiredo-Reveilles [7]. 

Their discrete naive 3D line is the subset of 7? defined by the intersection 
of two planes obtained by extrusion of two discrete Bresenham 2D lines. The 
properties of such a discrete naive 3D line are the following ones: 

— 26-connected. 

~ minimal (i.e. if one removes a single voxel, the line is no more 26-connected). 

— functional over (at least) 2 planes within Oxy, Oxz, or Oyz. 

Using this definition, the problem of generating a discrete 3D line is reduced 
to two 2D problems, since the selected discrete 3D line is characterized by two 
discrete 2D Bresenham lines: D\ and D 2 

Theorem 1. The two 2D Bresenham lines characterizing the naive 3D line are 
functional over the same axis, i.e. X. This axis is the common one to the two 
planes used to define the 3D line. 

Theorem 2. The set of all the lines parallel to a given naive 3D line defined by 
Figueiredo-Reveilles tiles the space. 

The proof can be deduced easily from the definition of a naive 3D line. 

Proof. Given a discrete naive 3D line L, it is characterized by its two projections: 

— (On plane Pi) Li(ai, 61, ci, wi) 

— (On plane P2) ^2(02, ^'2, C2, W2) 

Each point P{u,v,t) of the space can be projected on Pi and P2. The Bre- 
senham 2D line Li tiles the plane Pi, by definition. So there exists a unique 
ki G Z such that the projection of P in the plane Pi belongs to Pfe, . The same 
argument holds for L2 with respect to the plane P2. 

The lines Lk^ and Lk^ define a unique Figueiredo-Reveilles’ naive 3D line 
according to its definition. 
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4 Discrete Ray-Casting using Figueiredo-Reveilles’ Naive 
3D Lines 



In this section, we will examine how to develop a discrete ray-casting algorithm 
using the naive 3D lines defined previously and having the important property 
that each point of the scene is reached once (i.e. no voxel is missed and no 
voxel is reached more than once). This property holds only if we use a parallel 
projection and the definition of Figueiredo-Reveilles to define the rays, since 
Theorem 2 ensures that the set of all the lines parallel to a given discrete line 
tiles the space. 



4.1 Sketch of the algorithm 

Let C be the center of the parallelepipedic orthothetic discrete scene bounded 
by (0,0,0) and (rtmaxi i^max, fmax) that we want to visualize. 

As we are using parallel projection, the point of view is placed at infinity. To 
define the viewing direction, we will consider a point O on the line connecting 
this point of view to C, O being at finite distance of C. The algorithm is divided 
in two main sections: the initializations and the main loop. 

— The initialization process consists in (1) selecting the horizontal and the 
vertical planes for the 2D projections of the Figueiredo-Reveilles 3D lines. (2) 
The initialization process then computes the coefficients of the corresponding 
2D Bresenham lines in the above selected planes. (3) The last initialization 
step computes the 2D boundaries of the projection of the volume in order 
to restrict the subsequent 3D lines generation to the actual data volume. 

— The main loop consists in (1) computing the entry points into the data 
volume for all the 3D lines, then these lines are (2) incrementally generated 
until either they reach a none empty voxel or they exit from the data volume. 



4.2 Initializations 

1. To select the two 2D projection planes P\, and P 2 corresponding to the 
3D discrete line connecting O(uo,vo,to) to C(uc,Vc,tc), we compute: m = 
max(|Z\u|, |A„|, |At|), where Au = ug - Uc,Ay = vg - Vc,At = tg - p. If 
m = |Z\„| then (horizontal plane) Pi = Ouv and (vertical plane) P 2 = Out. 
If m = |Z\„| then (horizontal plane) P\ = Ouv and (vertical plane) P 2 = Ovt. 
If m = \At \ then (horizontal plane) P± = Ovt and (vertical plane) P 2 = Out. 

2. Let Dfcj (resp. D(.^), be the 2D projection of the discrete 3D Figueiredo- 
Reveilles in plane Pi (resp. P 2 ). We can suppose, without any loss of general- 
ity, that the slope of these 2D lines is in [0, 1] (other cases are symmetric). 

is defined by: a = Ay, b = —Ay, c = [^] — {aug + bvg + kiu)) , oj = max(|a|, |6|) 
(resp. is defined by: a = At, b = —Ay, c = [f ] — {aug + btg + k 2 Uj), 
to = max(|a|, |&|) ). 
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3. To know which rays must be sent through the data volume, we define the in- 
tervals in which k\ and k 2 are bounded, using the projections of the data vol- 
ume vertices. These intervals are: k\ G [fcimim ^imax] and ^2 G [femin, A2max] 

with: A:i niin — .^I)(0?Wmax)j max — ('^maxi 0) j ^2 min — (O5 '^max) j 

^2 max — (tmax? 0) ■ 

4.3 Main Loop 




Y 



Fig. 1. A Discrete Naive 3D Line Fig. 2. Intersection 



1 . For each discrete 3D ray, we compute the intersection with the data volume. 
As a Figueiredo-Reveilles’ line is characterized by two Bresenham lines, and 
as these two 2D lines are functional over the same axis (Theorem 1), the prob- 
lem of finding the 3D intersection is reduced to two 2D intersection compu- 
tations. Furthermore, as we assume that the two 2D lines are functional over 
the X axis, the entry point to the data volume is (u, F(Dfc^,M),T(D(.^,M)) 
with U G [0,'Uniax]j ^ G [0,Umax]j ^ [Ojtniaxjj ^ — 



aiu-\-ci — kibi 
bi 



, and 



u) 



Q2^ + C2 — ^2^2 

b2 



We now consider the problem in the plane Pi, (the similar problem in P 2 is 
not discussed here). As the current projected line in Pi lies within the first 
octant (hypothesis), Dk^ can only cut the projection of the 3D data volume 
across two edges: Edge 1: connecting (wmaxjO) to (wmaxji’max) or Edge 2: 
connecting (Umax, ^max) to (0,t>max)- 

In the first case, the target point is u = Umax even if the line has a long run 
as depicted on Figure 2. 

In the second case, we have to find the point (u, Umax)- If this point does not 
exist, which may happen since u must satisfy the constraint u G [0,M„iax[) 
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the intersection is void. This means that the corresponding 3D ray does 
not hit the volume. To compute this intersection point, if any, we calculate 
the discrete intersection of the line Li and the line v = Vmax [17]- This 
intersection is usually composed of more than one pixel, but in our case, we 
can be sure that they do belong to the same run. The intersection pixel is the 
first pixel according to the way of generating the discrete line. It is the pixel 
with the smallest remainder, since the remainder is growing incrementally 
along a given run. Futhermore this pixel can be found directly using the 
arithmetical definition of the discrete line. 

Using this method we obtain ui in the plane Pi such as {ui,V{Li,ui)) € 
box (0,0, Umax, ^^max)^ and U 2 in the plane P 2 such as (u 2 , T(L 2 , M 2 )) G 
box (0, 0, UmaxUmax)- Since Li and L 2 are functional over the same axis U, 
the entry point of the ray in the scene is defined by: {u,V{Li,u),T(L 2 ,u)), 
where u = min(ui,M 2 )- 

2. The Figueiredo-Reveilles’ naive 3D line generation algorithm is a straight for- 
ward consequence of the generation algorithm a 2D Bresenham Line. Given 
two points A and B of the space, without any assumption about their posi- 
tions, Algorithm 2 shows how to generate the Figueiredo-Reveilles’ naive 3D 
discrete segment [AR]. The first part of the algorithm determines in which 
octant B is, when A is considered as the center of the space. The horizontal 
plane and the vertical plane are chosen. Then two Bresenham algorithms are 
run: at each step, we consider the next coordinate over X, since the choice 
of the next coordinate over Y or Z depends on the remainder ri and T 2 . 



4.4 Discrete Z-Buffer 

The Ray-Casting method allows to determine the visible points in the scene, 
but to produce a correct picture with these data, we need to be able to compute 
the screen to object distance in order to produce an approximation to normal 
vectors. 

Most of well-known methods, such as Z-Buffer shading, need to know for a 
given point its distance to a fixed point: i.e. the screen or the point of view. As 
we are using a parallel projection, we can’t refer to such a fixed point. 

Here again, we are using a discrete approach. We refer to the point O, and 
we define a naive plane going through O and normal to the discrete 3D line OC. 
As the set Pk of all the naive planes parallel to a given one tiles the space, we 
can use k to define the depth of any voxel of the scene according to the plane 
going through O. 

This naive plan is defined by: 

a = Uc — uq, b = Vc — Vo, c = tc — to, d = [^] — {auo + bvo + cto), and 
w = max(|a|, |&|, |c|). 

Normal vectors are computed using forward differences in the discrete Z- 
Buffer, enhancements can be implemented here using [11,16]. 

^ box(a,b,c,d) denotes the rectangular 2D orthothetic discrete box of opposite vertices 
(a,b) and (c,d). 
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Algorithm 1 Algorithm generating Figueiredo- Reveilles’ naive 3D discrete line 
Required : A, B € Z® 

Ensures : procedure PLOT is called for each point of [AB], 

A[0] B[0] - A[0] 

A[l] ^ B[l] - A[l] 

A[2] B[2] - A[2] 

if zi[0] > zi[l] et zi[0] > A[2] then 

else if A[l] > A[0] et A[l] > A[2] then 

Xt-l;yt- 0 ;Z -«-2 

else if A[2] > A[0] et A[2] > A[l] then 

Xt- 2 ;yt-l;Zt -0 

end if 



pt A 




repeat 

PLOT(pt) 



i t— i + 1 

pt[X] t- pt[X] + sign (A[X]) 
ri t- n + |A[y]| 
if ri > |A[X]| then 
ri t- n - |A[X]| 
pt\Y] t- pt\Y] + sign (A[X]) 
end if 

C2 t- r2 + \A[Z]\ 
if V 2 > |A[X]| then 
C2 t- r2 — |A[X]| 
pt[Z] t- pt[Z] + sign (A[X]) 

end if 

until i > |A[X]| 



4.5 Geometric Correction 

The intermediate screen resulting from the ray-casting algorithm needs to be 
rescaled to fit the final rectangular screen. Although this geometric correction 
can easily be performed using conventional methods [10], we have implemented 
a discrete affine mapping method. This approach is based on the fact that the 
Bresenham 2D lines tile the plane, so we project runs of adequate lines over 
single pixels, in order to rescale the whole intermediate screen. 

5 Results 

Visual results are illustrated Table 3, and 4. Runtime comparisons are given 
between our program and “Bob” from Graphics and Visualization Lab of the 




Discrete Ray-Casting 443 



Army High Performance Computing Research Center, Aberdeen, MD (USA), 
a freeware conventional real number based visualization tool^, and “volume” a 
home made discrete visualization tool based on the scan of the hole discrete 3D 
data space volume. 





12^'^voxels 


IbO'^voxels 


bob 


n/a 


1 min 17 sec 


volume 


0.49 sec 


0.55 sec 


Discrete Ray Casting 


0.15 sec 


0.21 sec 



6 Concluding Remarks 

We presented a method for volume viewing exclusively based on the use of integer 
arithmetic. The originality of our algorithm is the use of Figueiredo-Reveilles’ 
3D discrete lines to model the rays, which ensures that each voxel of the scene 
is reached only once, since these rays tile the data volume. The front to back 
algorithm used ensures that only the voxels in front of the visible object are 
scanned. All this leads to interesting overall runtimes. This program can, in 
addition, be interfaced with any usual shading algorithm. 

Extensions of this work can be integrated into the rendering part of a discrete 
modeler. The discrete rays can be used in any other field, for example in radio 
wave propagation simulations. 
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Fig. 3. Twelve views of a Molecule 
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Fig. 4. Twelve views of a C60-Molecule 
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Abstract. Precise knowledge of normal vectors to discrete objects is 
mandatory in rendering algorithms. This article introduces a new method 
for the calculation of normal vectors to a digital object. This technique 
relies on discrete geometry theories : the recognition of discrete straight 
lines and tangential lines in dimension 2. Results obtained with synthetic 
and real objects from medical imagery are presented and commented. 



1 Introduction 

Well estimating the normal vectors of a digital object is primordial for the sim- 
ulation of lighting and special effects in relation with light on digital objects. 
But these objects are in general digitized and the normal vectors to volumes are 
unknown. 

Numerous authors have been interested in the calculation problem of normal 
vectors to a digital object by using miscellaneous approaches. A synthesis of 
the works undertaken on this subject until 1992 is presented in the article [15]. 
Since this, other approaches considering the 3D neighborhood of the points has 
appeared, one of them [13] uses the volume around the point where the normal 
vector is calculated. Another one [10] is based on a generalization of the umbrella 
notion [9, 4, 5, 6] at the vertex of a polygonal volume. 

The method proposed in this article takes his origin in a DEA practice [8] 
and considers that an integer point is represented by a voxel; a cube whose 
side is one unit and whose faces are surfels. The neighborhood of each point is 
considered according to its regularity, thus plane parts and holes are integrally 
rendered without any threshold. 

This work introduces a new estimation of the 3D discrete normal vectors 
based on the definitions of discrete straight lines and tangential lines in dimension 

2 given in [3,2,14,12,11]; to calculate the normal vector at one point to the edge 
of a digital object, we use the slices of this object along the three canonical 
planes through this point (see Fig. 1). On the outline of each slice, we calculate 
the tangential line through the considered point [14] then by combining the 2D 
tangential lines obtained (1, 2 or 3) and by applying a vectorial product, we 
obtain the 3D normal vector to the point of the voxel object. A surfel approach 
is also proposed with a similar method by calculating the normal vector at each 
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Fig. 1. Digitized mushroom and its slices 



surfel to the edge of the voxel object. Firstly, we recall the results of 2D discrete 
geometry necessary for the calculation of the 3D normal vectors. Then, we define 
and explicitly give the calculation of normal vectors to any voxel object according 
to two methods in 8-connexity for the calculation of a normal vector to a voxel 
of the edge then in 4-connexity for a surfel of the voxel object edge. Finally, we 
comment the results obtained with synthetic objects and real objects. 

2 2D Discrete Straight Lines and Tangential Lines 

We present in this section the basic notions of 2D discrete geometry useful for 
the determination of 3D normal vectors. 



2.1 Discrete Straight Lines 

The arithmetical definition of a discrete straight line has been introduced by JP. 
Reveilles [12,11] as follows : a discrete straight line whose slope is | with 
6 yf 0 and a A b = 1, with a lower bound fx, an arithmetical thickness to, is 
the set of points (x, y) of which satisfies the double diophantine inequation 
fj, < ax — by < g + to, where all parameters are integer. 

We call the preceding discrete straight line T>{a,b, In the following, we 

shall be interested in the naive straight lines which verify oj = sMp(|a|, |6|) 
and which are 8-connex and, in the standard straight lines which verify uj = 
joj + j&j and are 4-connex (see Fig. 2). In order to simplify the writing, we shall 
suppose in the following that the slope coefficients verify 0 < a < 6 yf 0. 

2.2 Recognition of Discrete Straight Line Segment 

Let us consider E a segment of T>, naive or standard straight line whose char- 
acteristics are a, b, g, lo with 0 < a < & yf 0 and I is the maximal value of the 
segment points on the x axis. Let us suppose that the point M(xm,2/m) (with 
xm = Z -b 1 or xm = 1) is added to S, \s E' = E U {M} a segment of the 
straight line and, in this case, what are the characteristics a' ,b',g'l This prob- 
lem is solved in [3,2] and relies on some particular points of a discrete straight 
line defined hereafter. 
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The real straight lines ax — by = fi and ax — by = /x + w — 1 are called leaning 
straight lines of the naive discrete straight line T>{a,b, g,uj). An integer point 
of these straight lines is called leaning point. 

The leaning straight line located above (resp. under) T> has the following 
equation ax — by = g (resp. ax — by = g + u> — 1), it is called upper leaning 
straight line (resp. lower leaning straight line) of V. Some leaning points 
are particularly important along the recognition of a straight line segment, they 
are the ones located at the extremities of the segment whose recognition is in 
progress. We note Ub (resp. Lb) the upper leaning point (resp. lower) whose 
value on the x axis is minimal. In the same way, we note Up (resp. Lp) the upper 
leaning point (resp. lower) whose value on the x axis is maximal (see Fig. 2). 




Fig. 2. On the left hand side, the naive straight line T>{5, 8, —1, 8) for x G [0, 15] 
and on the right hand side, the standard straight line T>{5, 8, 0, 13) for x G [0, 15], 
the upper leaning points of these straight lines are in dark gray and the lower 
leaning points in light gray. 



Theorem 1 ([3,2]). Let us consider r{M) , the remainder at the point M{x,y) 
as a function of V defined by : 



r(M) = ax — by 

(*) If h ^ r{M) < g + u>, then M G I?(a, b, g), S U {M} is a segment of the 
straight line T>. 

{ii) If r{M) = g—1, then SU {M} is a segment of the straight line whose slope 
is given by the vector UbM with Ub the upper leaning point ofV whose value 
on the X axis is minimal. 

{Hi) If r{M) = g + co, then S\J{M} is a segment of the straight line whose slope 
is given by the vector LbM with Lb the lower leaning point ofV whose value 
on the X axis is minimal. 

{iv) If r{M) < g — 1 or r{M) > g + to, then S U {M} is not a segment of a 
discrete straight line. 

This theorem allows to obtain an incremental algorithm of discrete straight line 
segment recognition [3,2] by scanning series of pixels 8 or 4-connex called dis- 
crete way. The algorithm in 8-connexity has been adapted by A.Vialard [14] 
in order to determine a notion of discrete tangential line. 
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2.3 Discrete Tangential Line at a Point to a 2D Curve Line 

The discrete tangential line at a point M to a discrete way C is defined [14] 
as the longest part of C centered to M which is a segment of a discrete straight 
line. 

The algorithm of discrete straight line segment recognition centered to a 
point is used in order to determine the longest discrete segment centered to M 
by adding at each step a point to each extremity. If the couple of added points 
does not permit to enlarge the segment, none of them is added. 

Addition of a point at the front extremity of a segment of the naive or standard 
straight line T>{a,b, of the first octant 
r = axM - byivi ; 

\fg<r<fj, + u! 

If r = /r then Up = M Eif 

lfr = /r + w— 1 then Lp = M Eif 

else 

If r = /r — 1 then 

Lb = Lp ; Up = M ; 
a = |yM - YubI ; b = |xm -xuJ ; 
fj, = axM - byM ; 

else 

If r = /X + w then 

Ub = Up ; Lp = M ; 
a = |yM -YlbI ; b = |xm -xlJ ; 
g = axM - byM - w + 1 ; 

else 

the new point cannot be added to the segment 

Eif 

Eif 

Eif 

To add a point at the rear extremity of the segment, the preceding algorithm 
must be used by exchanging Lp and Lb as well as Up and Ub. 

We define the right tangential line (resp. left) at a point M to a discrete 
way C as the longest part of C following (resp. preceding) the point M which is 
a segment of a discrete straight line. The calculation of the right (resp. left) tan- 
gential lines uses the algorithm to add a point at the front (resp. rear) extremity 
of a segment of a naive or standard straight line given above. 

3 Calculation of the 3D Normal Vectors at each Point of 
a Voxel Object Edge 

The calculation of the normal vectors at all points of a voxel object relies on the 
study of outlines associated with each point of the object located on the object 
slices through the considered point. We call slice in x (resp. y, z) at the 
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point M, with M{xmtVm,zm), the set of points of the object having the same 
co-ordinate xm (resp. ?/m, zm) (see Fig. 1). 

In the following, the slice at a point according to one co-ordinate may be 
viewed as an object in two dimensions. The outlines associated with an object 
slice are composed of the closed 8-connex series of pixels of the slice edges (see 
Fig. 5). 

We shall call associated pixel to M the 2D points corresponding to the 
projected points of M along the co-ordinate planes. Normal vectors at each point 
of the object are calculated as a function of the outlines which these points belong 
to. 

3.1 Voxel Position on the Edge of the Object 

We have voluntarily adopted a realistic viewpoint by wanting to calculate the 
best normal vector estimation to obtain an efficient rendering. Thus the voxel 
approach, which consist in considering an integer point as a cube with 6 faces, 
leads to problems due to the point position on the edge of the object. In the 
general case called regular cases, voxels possess 1, 2, or 3 faces called surfels, 
not opposite 2 to 2 others, on the edge of the object. In the other cases called 
non regular, we find two types of voxel configuration : 

— voxels having at least two opposite faces adjacent to the edge, ie. belonging 
to the ’’fine” locations of the object with a 1-voxel thickness, (Fig. 3 right) 

— corner or side voxels (Fig. 3 left) corresponding to the object discontinuities. 




Fig. 3. On the left hand side, in gray regular voxels and in white non regu- 
lar voxels. On the right hand side, an example of a fine object with a 1- voxel 
thickness. 

The different configurations are simply detectable in dimension 2 on the outlines 
calculated at the M considered point. Let us suppose that each outline is coded 
through a Freeman code [7]. For each outline through M, we note ds (resp dp) 
the direction which represents the move to the following (resp. preceding) pixel 
associated to M on the outline according to the trigonometric way. 

The general configuration of the voxel M is if, in each slice through M, 
there is at the most one outline through a pixel associated to M and if : 
ds = dp + 3[8] or dp = dp + 4[8] or dp = dp + 5[8] 

In the other case. 
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— if, in a same slice, there are several outlines through a pixel associated to 
M, the point M is located at a ’’fine” location of the object, 

— if, on an outline through M, the directions ds and dp are not compliant, 
the voxel object shows a discontinuity at this location. 



3 




5 



Fig. 4. On this figure, dp = ^^ the only possible back directions to be in the 
general configuration are ds = 3, 4 or 5. 



Examples of these three point types are shown on one of the outlines asso- 
ciated to a slice of the digitized mushroom of Fig. 5. For instance, at the pixel 
R, dp = 0 and dp = 7 are two directions which are not compliant, thus there 
is a discontinuity at the related point. It is clear that no unique normal vector 
may be assigned to a voxel in a non regular configuration, the decomposition 
in surfels of this voxel must then be considered. Normal vector calculation for 
these different configurations are described in the following paragraph. 




Fig. 5. The outlines of that mushroom slice are in dotted lines. Pixels L to M 
and M to N are non regular pixels because there are two outlines through these 
points; it is the same for pixels O, P, Q and R because directions dp and dp 
associated to these pixels are not compliant. The other pixels of the outlines (in 
gray color) are regular. 



3.2 Definitions 

In the general case, we define normal vector at the point M of the voxel 
object edge as the sum of (one, two or three) vectors obtained through the 
following steps : 
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— seek in each slice associated to the point M, on the outline through M, the 
2D tangent vector to the pixel associated to M, 

— if 2 or 3 tangent vectors are obtained, compute the vectorial product of 
these vectors, if not take the perpendicular vector to the only tangent vector 
obtained. 

The research of the 2D tangent vectors needs several phases : 

(z) Determine the 3 slices which the point M belongs to. 

(zz) Seek, in the list of outlines associated to each of these 3 slices, the outlines 
through M. 

(zzz) For each outline through M , consider this outline as a discrete way and 
calculate the tangent line to the pixel associated to M by using the algorithm 
in 8-connexity described in Sect. 2. 

The normal vector obtained is then associated to each surfel of the voxel belong- 
ing to the edge of the object. 

In the non regular cases, the research of tangent vectors and the allocation 
of the normal vectors to the surfels are different; surfels of the edge of a same 
voxel will not have the same normal vector. Thus three tangent lines (one by 
slice) may be associated to each surfel of a voxel, the research algorithm of the 
normal vectors to the surfels is the following one : 

In the three slices which the point M belongs to, 

— If the pixel associated to M is isolated, the standard tangent lines are as- 
signed to each of these sides then forwarded to the corresponding surfels (see 
the study of point B in the plane Oxz of Fig. 6). 

— If not, for each outline through a pixel associated to M, 

• If the outline has a discontinuity at this point (dp and dp not compliant), 
calculate the left and right tangent lines to the considered pixel by using 
the algorithms in 8-connexity described in Sect. 2. The allocation of 
the tangent lines to the pixel sides then to the corresponding surfels is 
undertaken in the following way : 

* The surfel which corresponds to the side adjacent to the direction 
(dp) in the trigonometric way inherits the half-tangent line which 
corresponds to the front half-outline. 

* Symmetrically, the surfel which corresponds to the side adjacent to 
the direction (dp ) in the inverse trigonometric way shall get the 
value of the tangent line associated to the rear half-outline. 

* Finally if, by scanning the set of directions in the trigonometric way 
from dp to dp, one of the pixel sides is on the edge of the object and 
does not possess any tangent line in this slice, assign to it the sum 
of both half-tangent lines oriented in the same way. 

• If not, the outline has no discontinuity at this point, calculate the tangent 
line to the pixel associated to M by using the algorithm described in Sect. 
2. This tangent line is associated to each surfel (represented by a side of 
the pixel) included in the part of the plane associated to this outline. 
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Fig. 6. Study of two voxels A and B non regular on a voxel object. On the left 
hand side, the voxel representation of this object with its slices. On the right 
hand side, the tangent lines obtained on each outline for the points A and B. 



As a function of the tangent lines assigned to the surfels, the normal vectors to 
the surfels are calculated by vectorial products. 

Remarks 

1 . The non regular cases only represent a very small part of the synthetic or real 
voxel objects; results are given in section 5. But taking into account these 
particular cases allows to obtain a very realistic rendering on the extremities 
of the objects even in case of irregularities (see Fig. 8). 

2. Trials have been done by using variants [14] of the discrete tangent line 
definition given in paragraph 2 : 

— Reduced tangent line : points of the segment extremities are deleted. 

— Asymmetrical tangent line : prolongation of the tangent line at one ex- 
tremity. 

— Asymmetrical reduced tangent line : combination of the two preceding 
definitions. 

These different definitions have not given satisfactory results and do not bring 
any improvements to the first definition. 



4 Calculation of the 3D Normal Vectors at each Surfel 
on the Edge of a Voxel Object 

After the definition of the preceding chapter, it seems to be natural to be inter- 
ested in a surfel approach, by calculating the normal vector at each surfel on the 
edge of a voxel object. The general idea remains identical but a few supplemen- 
tary definitions are necessary : The pixel sides are called lignels and its vertices 
are called pointels. The pointel outlines associated to a slice of the object 
are composed of the 4-connex series of pointels of pixels on the edge of the slice 
(see Fig. 7). The normal vector at a surfel on the edge of a voxel object is 
the vector obtained after the following steps : 
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— Seek the two slices it belongs to. For instance if the surfel is parallel to the 
plane Oxy, both slices shall be parallel to the planes Oxz and Oyz. 

— Determine for each slice the pointel outline through the lignel associated to 
the surfel, then calculate the discrete 4-connex tangent lines to each pointel 
which delimits the lignel associated to the considered surfel. The tangent 
line to the lignel is obtained by calculating the average tangent line to the 
pointels. 

— The vectorial product which is the result of the tangent lines calculated in 
each slice is the normal vector to the surfel. 




Fig. 7. Study of two surfels A and B on a voxel object. On the left hand side, 
the voxel representation of this object with its slices. On the right hand side, 
the tangent lines to the pointels obtained on each pointel outline for the surfels 
A and B. 



Remarks 

1. For each surfel on the edge, four calculations of the discrete tangent lines are 
necessary (only one is necessary in the voxel approach in the regular cases). 

2. The proposed method may always be applied, there is no particular cases. 

3. Two tangent lines are assigned for each surfel ; in the preceding method, up 
to three tangent lines might be assigned. 

5 Results 

The evaluation of the normal vectors is realized in two ways ; on synthetic objects 
by comparing the obtained normal vectors to the theoretical normal vectors and 
by visualizing them on digitized objects. The Figs. 9 to 15 show in the left hand 
side the voxel object and in the right hand side the rendering obtained thanks 
to the normal vectors calculated at each voxel of the object. 

We have compared the normal vectors obtained to the theoretical normal 
vectors calculated as follows: 

— A plane P{a,b,c, is defined by the set of integer points {x,y,z) which 
verifies y, < ax + by + cz < y, + oj where all parameters are integer. The 
theoretical normal vector is the vector (a, 6, c). 
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— A sphere with an integer radius r is defined by the integer points (x,y,z) 
which verifies x'^ + y'^ + z'^ < r(r+l) [1]. The theoretical normal vector is the 
vector starting from the center of the sphere through the voxel (resp. surfel) 
centers in the case of the voxel (resp. surfel) approach. 

— The theoretical normal vector to a voxel (resp. surfel) of a cube is the normal 
vector to the face it belongs to. 

The angle between the theoretical normal vector and the estimated one is cal- 
culated at each point of the tested objects. The average value and standard 
deviation of these values are presented in the following table according to the 
methods presented in Sects. 3 and 4. We observe more important errors with the 
second method on objects as the cube (see Fig. 8); corners are smoothed. 



Table 1. Discrete normal vectors vs theoritical normal vectors 



8-connexity method (Sect. 3) 


4-connexity method (Sect. 4)| 


Object 


Avg. error 


Std. deviation 


Avg. error 


Std. deviation 


Sphere radius = 5 


4.83 


3.63 


6.95 


5.23 


Sphere radius = 15 


2.87 


2.20 


3.01 


1.86 


Sphere radius = 25 


2.84 


2.24 


3.22 


1.92 


Cube side =15 


0.00 


0.00 


8.18 


4.82 


Plane (l,2,-5,-10) 


1.78 


0.00 


1.78 


0.00 


Plane (3, 4,-5, -5) 


1.02 


0.00 


1.02 


0.00 



In order to estimate the cost of both methods, the following table gives, 
for each presented object, the resolution, the number of voxels and surfels of 
the edge, the number of non regular voxels as well as the computing time of 
Freeman’s codes of the object slices and computing time of the normal vectors. 
Computing time has been obtained on a Silicon Graphics RIOOOO 195 Mhz. 
Moreover the pre-processing step for the calculation of object slices outlines has 
not been optimized yet. 



Table 2. Computation times 



Object 


Resolution 


Edge voxels 


Surfels 


Non regular 


Outlines 


Normals 


Blood veins 


266x285x241 


40912 


89238 


11232 


438 s 


40 s 


Brain 


128x128x128 


32259 


60052 


16907 


47 s 


96 s 


Head 


128x128x128 


36108 


62006 


1910 


15 s 


53 s 


Mushroom 


86x100x56 


15253 


26392 


714 


2 s 


14 s 


Knight 


249x157x156 


121540 


246714 


69026 


47 s 


251 s 


Sphere 


50x50x50 


6342 


11646 


0 


0.6 s 


4 s 


Sphere 


100x100x100 


25746 


46950 


72 


5 s 


28 s 


Sphere 


200x200x200 


103734 


188382 


0 


50 s 


198 s 
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6 Conclusion 

We have presented an original, simple and efficient method for the calculation 
of the normal vectors to a voxel object. It relies on strong theoretical bases, the 
arithmetic definitions of discrete straight lines. Numerous examples have been 
tested by using real data coming from medical imagery; the algorithms used 
have proven their consistency and quickness. They are currently used by voxel 
object handlers. Nevertheless an accurate comparative study between the differ- 
ent methods existing in the literature [10,13,15] still needs to be done to estimate 
which one shall be the most efficient. 
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Fig. 8. Rendered in 4 and 8 connexity Fig. 9. Sphere 50x50x50 




Fig. 10. Knight 39x25x25 Fig. 11. Knight 249x157x156 




Fig. 12. Brain 128x128x128 Fig. 13. Head 128x128x128 




Fig. 14. Mushroom 86x100x56 Fig. 15. Vein of human liver 

266x285x241 
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